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Computers  have  effected  a  comprehensive 
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status  and  the  potential  of  computer-assisted  instruction  (CAI)  iJi 
Qhepistry;  and  1(4)  inspire  further  examination  of  the  chemistry 
curriculum  in  light  of  recent  advances  in  'computer  technology.  The 
conference  report  is  a  collection  of  80  papers  divided  into  the 
*foll/owing  categories:  data  fitting;  curve  fitting  ,and  band  analysing 
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communications  in,  the  field  of  chemistry;  computer  control  of 
experiments;  the  use  of  A  Progi^aming  Language  (A?L)  in,  the 
undergraduate  curriculum;  and  information  systems.  (EMH) 
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•     A  STATfiMENT  OP  SUBPOSE 

.      \  / 

Sl«ptronlc  digital  and  analog  .coaputers  have  tfftctad  a  coaprebenaivt 
tranaf orwition  of  choMistry.    They  have  greatly  enhanced  model  building 
or  simulation,  data  refineaent  and  reduction,  analysis  of  data  in  terms 
of * models,  on-line  data  logging  and  control  of  experiments,  quantum 
chemistry  and  statistical  mechanical  calculations,  and  information 
storage,  handling  and  retrieval  including  dynamic  visual  display, 

e  .  -         ' ,  '  ^  • 

,  Sophistipated  technique  for  data  refinement  and  reduction, 
such  as  time-averaging,  curve  fitting  and  Fburier  transforms 
noK  can  be  systematically  apjplied,  thus  enabling  the  experi- 
menter .to  extract  much  more  ^useful  information  trtm  his  data, 

Vith  a  much  larger  variety  pf  nuaerical  techniques  available 
.  to  him,  the  ch^st  .is  no  Idnger  limited  to*  the  formuli,tion  , 
.   of  only  those  mathematical  ipodels  for  which  analytical 
,  aolutions  in  closed  form  can  be  found.    Models  can  be  made  , 
much  more  comprehensive. 

The  rapid  growth  of  the  mini-coaputer  industry  has  provided 
relatively  inexpensive  and  powerful  small  computers,  and 
integrated  circuit  module  technology  ekabTes  the  chamist 
to  design  ahd  cijhstrudt  his  own  interfiitces  "between  the 
control  computer  axvi  hds  e^rlment,  \ 

Other  non-nuBoric  applications  of  the  cbmputer  include 
techniques  for  the  systematic  search  of <|the  chemical 
literature,  'structure  detertiijtation  and  moleciile  identi*- 
fication  by  data  matching,  and  design  of  coaplex  organic 
synthesis, 

.  .  >^ 

A  major  prollfem  facing  the  chemical  profession  is,  that  most  chemists 
have  not  yet  become  reasonably  aware  of  how  important  the  computer  has 
become  in  chemistry,         .    .  , 

The  problem  is  further  compounded  in  that  chemistry  curricula  have 
not  yet  been  significantly  content  modified  in  order  that  these  new 
technique^  can  become  kA  integral  part  of  the  education  pf  the  chemistry 
student. 

The  problem  is  in  part  one  of  communication  between  those  few  developing 
these  new  techniques  of  chemical  problem  solving  and  those  many  who 
can  use  them  and/o^  teach  them. 

Consequently,  this  .Conference  has  been  organised  tot 

 provide  a.  nati^onal  forum  on  the  impsct  of  coaputerd  on  the 

way  chemists  do  chemistry. 
— ^encottcage  wider  investigation  of- the  use  of  computers  in 

chemical  research, 

 make  known  the  current  status  and  ths  potential  of  computer 

'  assisted  instruction,  and 
—  inspire  further  examination  of  chemistry  curriculum  consent  in 

reaction  to  computer  utilisaMon, 
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Data' Acquisition  and  -Refinement 
Intrtduction 


K.  Biemann  .  _  ^      .  * 

Department  of  Chemistry  '  ■   .         '  .  > 

Massachusetts  Institute  df  Technology  .  K 

\^       Cambridge, -Ma,      02139  ^  >  ,  . 


* .  The -ever. imjfeasiiig  availability  of  computers  to  the  experimental 
'  hn<f,t:heoretijC^i  chetnis^t  and  his /Increasing  willingness  to  use  them  in 

flti  efficient  .^md^^it^^t  mode  of  interaction  opens  avenues  not  previously 
^*  ap.CQasi>4?f  expex:irt6ntal  data  allows  ^ 

'.much  tt^orjTitiiigrto        .tb  be  derived  frpm  them  and  permits  the  design  of 

"dxpejA:rneat8  In,  ^  while:  minimizing  the' 

"  immBey'jof  datf|  t6' 'b.e  taken,  SoipSiQticated  error  analysis  further  increases 
/  tfie.;cpnf  iijei^^^  th^  jf^athematical  principle  of  the  , 

,iricth*od^  only  the  availability  of 

i:qmp^^t#ts  [tHaifee'j^^  now  on  a  large  scale;  further- 

^TjioVe^^'^the.arlthm^  that  approaches 

^'jlrpyQaXv^  would  be.  impractical, 

^ ,  .^^  t3?\^.  aclVent^  of  ^t^ejaaall.  IjaborAtoxy/^cqmpyt     brpnyht  the  chemist 
.  ev^n.^iloaw.  to  .utildzption^of  .these  ^devices  ^and  instruments 

'direifi^tly  coupled, to  a^  computer  apK  ^^cotaipg  ^itfqre  an^.motife  widespread,* 
..The;  yfesoVtltig  the.  acpumuiail6n\of  ".sjpat^^^  meaningful 

\^l^OUht<a  of  <lata  ^ithiji-^ short ''feirte^,  s^  Increase 
8^gnalf{fidi^npl?e  ratio  blf  .the  ti^trument  aj\d-even,  f^  control 
\oi .  tlie j^atameters.'dl.  th0  instrument  by  the  cqmtnitet.are^  ^^Gomtng  ^  J 

,An^^t)(ipr^  Widely  .a^        EiTfortg  of  €ht^  tyj^e' will  tittcjbyiijtfi^ 
//£jh(H/i^^iwle4ge  cqnceroing  chemlc?tl,^  Fottun^telyV-Cp^^puteir    '  . 

te'c^itj^l^yo^  retifieving:  and  correlating  the  resulting, 
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Current  laboratory  practice  includes  the  us<e  of  small  computers 
^directly  on-.line'  to  gtither  and'  process  experimental  data.    More  .sophis- 
ticated utilization  will  include  the  use  of  the  computer  ,to  effect  or 
cqntrol  the  experiment.  '  Simple  control  is  referred  to  as  open- loop',  ^ 
while  if  the  processed  data  is  used  to  control  the  experiment,  it  is 
referred  to  as  closed-loop.  •  This  paper  will  be  coricerned  with  the  tech- 
niqujes  involved  in  obtaining  the  data  and.  the  techniques  of  oii-lineVata 
ptodessing  that  relate  particularly  to  the  on-line  applications.  Most 

'.V 

>6f  th^  |i(iaterial*  herein  is  not  original,  but  is  a  collected  set  of  inf  or- 
matiop  pertinent  to  on-line  problems.  .  .  •      .     ~  . 

'  .*     The  ''first  item  to  be  considered  in  an  on-line  application  is  the 
iHpiit  -data.    Usually,  ^*1:his  is  an  analog  signal  that  must  be  sampl^^i  land 
converted  to  digital  form.   'Shamfon's^  sampling  theorem  jbells  -us  that  we 
must  ^anviple  al:  least  twice  per  period  of  the  highest  frequency  canponent 
of  information  to  be  able  to  re'construct  the  input  signal.    If  Signals 
of  greater  than  twice  the  sampling  frequency  eXist  and  are  sampled,  the 


■X-      '  ' 

Work  performed  under  the .auspices  of  the  U.  S.  Atonic  Energy  Commission. 


high'  frequency  sampXed  datlsf will  be  Indistinguishable  from  lower  frequerfcy^  \ 
data.    This  is  illustratl^  iri  Fig.  1^    This  lack dtstlnguishabilitj?  ^„ 

"between  .two  sampled  frequencies 'is  called  aliasing.    It  must  be  noted  .  .  . 

**  •  ^    ' ' 

that  no  correction  of  the  sampled  data  can  be  made  once  aliasing  has 

occurred.    The  manner  in-  which^the  fi^equencies  are  aliased  ,is  also  shown 
ih  Fig.  1  and  is  a  periotjic  folding  of  "the; frequency  spectrum  between  the 
Nyquist  frequency  and  zerc.    The  Nyquist  frequency  is  the  highest  ffe- 
quency  represented  at  the  given  sampling  rate.    Figure  2    is  an  example  • 
of  chromatographic  peaks  digitized  in  three  methods,  each  having  a  dif- 
ferent^, inputs  data  bandwidth.    The  bandwidths  are  respectively  IKHZ,  lOHZ 
and'lfiZ. \  The  sampling  rate  was  20  samples  per  second.    The  aliased  noise 

•     .  >   \  .       .       «  ;  , 

due  to  extreme  bandwidth  is  obvious  on  the  first  example,  and  60HZ  line 
frequency  aliased  to  near  zero  is  quite  apparent  on  the  second.    The  third 
, represents  the  true*  quality  of  ther^ata  shown  in  its  own  bandwidth. 

The  previous  examples  showed  the  results  of  digital  data  without 
concern  for  the  method  of  analog-to-digital  conversi*on.    The  previous  , 
conclusions  are  independent  of  conversion;  however,  some  problems  are 
conversion  pethod  dependent.    Most  current;  analog-to-digital  (fi/D)  conversion 
is  performed  by  one  of  two  methods.'  These  are  successive  .approximation 
and  integrating  converters.    Successive  approximation 'converters  function 
by  successively  comparing  the  input  to  a  digital-to-analog  signal  and 
setting  a  bit  at  a  time,  starting  with  the  most  significant.'*  bit.  Dependii^ 
upon  whe^ther  the  unknown  is  greater  .or  less  < than  the  comparison^  the  cor- 
responding  bit  is  retained  or  reset.  _This  is  shown  diagramatically  in  the 


^  From  UCRL-71T2'5^  "An  Aiitomated,  High  Sensitivity,  Thiree  Column  Ga'g 
Chromatograph,"    Frazer,  J.  W.,  Duyal,  V.  and  Anderson,  R.  E.  ' 


top  of  Fig*  5.    The  process  is  sequenced  by  an  Internal  clock^he  total 
time ^ to  make  a  complete  comparison  is  known  as  the  conversion  time.  The 
A/D  converter  is  observing  the  unkriown  voltage  during,  the  entire  conversion 

A 

time.    We  now  consider  what  happens  if  the^  input  changes  during  the  con- 
versioi?  process.    See  the  bottom  of  Fig.  5.    If  the  signal  should  drift 
past  a  conversion  Isvel  often  the  comparison  is  made;  the  succeeding  com- 
.  parii^ons'will  all" be  in  error.    However,  if  the  change  j.s  less  than  the 
voltage  corresponding  to  one  least  significant  bit  and  linear,- the  result 
will  be  in  error  by  no  more  than  one  bit.    Thus  the  ^amplitude  of  the  signal 
at  one  quarter  of  the  maximum  conversion  frequency  must  be  less  than  the 
one  least  significant^  bit.    This  can  *be  accomplished' by  \imiting  the  band- 
width  or  the -signal  prior  to  the  a/D  converter:    This  is  in  addition  to 
that  required  to  limit  the  bandwidth  to  less  than  theNyquist  frequency. 

An  alternative  method,  to  filtering  for  kefeping  the  a/D  converter 
ipput  constant  is  a  sample-and^^d  circuit.    This  is  essentially  an 
electronic  switch  followed  by  an  analog  memory.    The  operation  is  sh6wn 
diagramatically  in  Fig.  4.    Note  that  it  takes  a  finite  amount  of  time  to 
sample  the  system  and  achieve  the  input  value.    This  is  called  the^aperture 
time  of  the  sample -and -hold  amplifier.    Since  it  t)akes  energy  from  the 
input,  and  this  energy  is  stored  via  a  capacitor,  the  circuit  has  a  response 
approximating  an  integrator.    T};(}^s  we  should  expect  a,  response  that  falls 
off  linearly  with  frequency  with  a  turnover  corresponding  to  the  aperture 
time.  '  In  other  words,  the  maximum  bandwidth  seen  by  a  sample-aiid-hold 
circuit  is  approximately  one  divided  by  the  aperture  time,  and  totally 
independent  oV  the  sampling  rate. 
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The  amplitude  rfesponse  as  a  function  of  frequency  of  a  simple  RC 
integrator  is  showp  in  a  semi-log  plot  in  Fig.  5.    Note  that  this  is 
asymptotic  to  two  straight  lines  with  a  corner  frequency  equal  to  the  time 
constant  of  the  RC  integrator.    A  gated  integrator  has  a  similar  response, 
except  that  it  has  zeroes  corresponding  to  the  frequencies  when  integer 
multiples  of  the  signal  period  coificide  with  the  gating  period.  Thus, 
the  sample-and-hold  circuit  gives  bandwidth  reduction,  however  the  corner 
frequency  may  be  very  high  for  short  aperture  times. 

The  second  most  popular  a/D  conversion  technique  is  the  integrating 
voltmeter.    They  function  in  a  number  of  methods,  but. all  rely  upon  inte- 
gration of  the  unknown  signal  for  a  periods  of  time.    Presently,  the  dual- 
slope  integrator  is  the  least  expensive  and  its  conversion  method  is 
representative.    It  functions  by  analog  integration  of  the  unknown  voltage, 
V:/  for  a  fixed  time,  f rom^ t^  to  t^.    This  is  line  (a)  in  Fig.  6.    Then  a 
negative  reference  voltag-e  is  integrated  until  it  reaches  zero,  from  t* 


until  t^.    This  is  line         in  Fig.  6.    This  time  is  digitally  counted,  * 
and  is  a  direct  measure  of  the  integrator  voltage  Vg,  which  is  proportional 
to  the  input  voltage  V^.    Note  that  the  sample  integration  time  is  less  * 
than  the  conversion  time.    This  is  ^Iso  true  for  a  voltage-to-fr^quency 
type  of  integrating  voltmeter  operating  at  less  than  maximum  sampling  ' 
rate\  This  has  the  same  amplitude  response  as  the.  previous  example  of  * 
a  gated  integrator.    The  amplitude  response  has  zeroes  and  falls  off 
reciprocally  with  frequency, ^ within  a  corner  frequency  depending  upon 
the  input  gate  time.    Thus  the  corner  frequency  i-ncreases  as  the  duty 
cycle  decreases,  independent  of  the  sampling  rate.    This  is  .shown  in 
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Fig.  7.    And  consequently,  the  use  of  an  integrating^ analog-to-digital 
converter  does  not  insure  a  lack  of  frequency  aliasing.    In  fact,  as 
performance- (frequency-wise)  improves,  the  aliasing  becomes  worse. 

One  more  fact  about  analog-to-digital  conversion  should  be  pointed 
out.     In  an  a/D  converter,  the  decision  is  always  made,  by  an  analog 
discriminator,  a  logical  element.    A  bit  or  a  result  is  made  upon  a 
gr-eater-than  or  less-than  comparison.    This  result  holds  until  the  input 
becomes -greater  thaii  the  next  comparison  level,  although  the  input  changes 
continuously  between  the  two  comparison  levels.    Thus  one  least  signifi- 
cant bit  is  the  minijuum  comparison  level  and  any  intermediate  level  is 
not  rounded,  but  chopped  to  the  lower  comparison  level.    This  error  intro- 
duced is  kftown  as  the  quantization  error.    For  true  rounding,  the  input 
signal  must  have  a  voltage  corresponding  to  one-half  the  least  significant 
bit  added  to  it  prior  to  conversion,,  if  the  input  is  positive,  and  sub- 
tracted if  the  input  is  negative. 

Now  we  turn  to  techniques  by, which  we-  try  to  improve  our  data  after 
it  has  been  digitized  and  stored  within  the  computer.    The' first  tech-  « 
niques  can  be  applied  to  the  data  on  a  continual  basis  as  it  is  being 
acquired  while  t-he  latter  must  be  applied  to  the  entire  data  set  as  a 
whole.  ^ 

One  of  the  first  digital  techniques  to  be  used  to  improve  the  signal- 
to-noise    ratio  (s/n)  of  digital  data  was  that  of  ensemble  averaging. 
Ensemble  Averaging  consists  of  multiple  scanning  of  the  observation  in 
some  variable  domain,  and    summing    the  experimental  observations  corres- 
ponding to  some  value  in  the  variable  domain.    See  Fig.  7..    The  intensity 


15 


of  some  measurable  is  deterinined  as  a  function  of  some  parameter  (usually 
a  function  of  time).    Then  the  experiment  is  repeated  and  the  value  of 
the  measurable  corresponding  to  the  same  value  of  the  parameter  is  summed. 
This  is  performed  a  prescribed  number  of  times.    Since  the  desired  vari- 
able   is  symmed  in  phase  coherence  with  respect  to  the  parameter,  its 

signal  shouia  increase  linearly  with  the  number  of.  scans.    Noise,  on  the 

m 

other  hand,  is*  incoherent  and  should  increase  as  the  root  mean  square  of 
thq  number  of  scans;    Thue  the  signal  to  noise  should  increase  as  the 
square  root  of  the  number  of  sc^ns.  >  The  theony  has  been  analyzed  in 
detail  by  Ernst^  and  indicates  the  ultimate  improvement  available. 

In  adc^rtion  to  the  improvement  in  signal-to-n^ise  ratio,  there  is 
also  an  improvement  in  resolution.    In  other  words,  the  final  datfei  has 
a  resolution  greater,  than  that  of  the  A/D  converter  used  to  take  the  data. 
This  is  shown  diagramatically  in  the  upper  part  of  Fig.  8.    When  the 'Input 
has  a  given  comparison^ value,  the  digital  output  is  that  of  the  comparison 
value.  .If  the  input  is  less  than  that  of  the  comparison  value,  the  output 
is  one  least  significant  bit  less.    For  intermediate  values,  the  output  is 
that  of  the  next  lower  comparison  value.    This  merely  illustrates  the  quan- 
tiz&tion  error  mentioned  previously.    Now  if 'the  input  contains  random 
iibise  in  addition  ta  the  signal,  the'  input  corresponds  to  the  lower  part 
0f  Fig.  8.     If. the  noise  is  white  noise,;  iie.,  its  frequency  distribution 
if  uniform  or  its  fi&nplitude  distribution  is  Qaussian,  then  it  has  equal 
^i^obability  of  being  above  or  below  the  comparison  value.    When  a  number 
o£; measurements  are  taken,  the  average  will  be  the  mean  of  the  measurements 


^  Ernst,  R.  R.,  Rev.  Sci  Insts.  36,  1689-9^  (1965). 
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and  will  be  more  representative  of  Jhe  true  valu^  If  the  noise  is  truly  ' 
r^dom,  then  its  standard  deviation  will  be  proportional  to  the  square  * 
root  of  the  number  of  measurements,  and  since  the  absolute  error  is  pro- 
portional^ to  the  standard  deviation  divided  by  the  number, of  measurements, 
o/n,  the  resolution  will  improve  with  the  square  root  of  the  number  of 
scans.    The  proportionality  constant  will  depend  upon, the  standard  devia- 
tion of  the  noise  signal.  ^  j 

It  is  significant  to  point  out' that  for  data  with  extremely  good  S/N 
ratio,  the  improvement  in  resolution  will  not  exist.    A  perfect  coherent 
ram^will  ensemble  average  as  a  staircase.    However,  the  data  can  be 
improved  by  the  addition  of  a  pure  noise  signal  at  the  input  to  the  a/D  * 
converter.    The  data  will  approach  linearity  if  the- standard  deviation 
of  the  noise  is  large  compared  to  the.(juantizatioli  erro^*.  Particularly 
note  that  ensemble  average  does  not  eliminate  the  quantization  error, in 
any  way.    The  data  approaches  the  true  value  if  the  added  noise  has  a 
mean  of. .one  half  the  least  significant  bit  of  the.A/D  converter.    In  all 
caaes  of  ensemble  averaging^  the  data  taking  and  the .experiment  must  be 
synchronized,  e;Lther  by  computer  control  or  by  a  trigger  signal  from  the 
experiment.  • 

For  unsynchronized  data  containing  a  periodic  component,  a  different 
digital  technique  is  available.    This  is  the  process  of  correlation.  Sincje 
»^ihe  periodic  component  is  coherent  with  its  own  period,  the  correlegram  will 
show  a  periodicity  with  the  same  period.    The  notse  in  the  signal  is  coherent 
only  in  the  region  of  zero  phase  shift,  and  thus  will  contribute  to  the 
correlegram  only  near  the  origin.    Thus  the  signal  value  for  large  values 
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of  signal  difference  will  show  the  same  signal  to  noise  improvement  as 
occurs  in  ensemble  averaging.    Since  this  periodicity  can  be  obtafned 
by  source  modulation, .this' technique  gives  S/N  enhancement  for  single 
value  measurements. 

One  additional  on-line  technique  exists  for  the  Improvement  of  data. 
The  technique  is  called  digital  filtering.    Although  long  known^  it  was 
.first  introduced  to  the  chemical  literatvire  by  Sanitsky  euid  Goulay^^ 
The  technique^ they  described  has  been  known  by  m^ny  names: '  digital 

: 

$ 

filtering,  convolution,  least  squares  fitting,  smoothing,  and  mqre.  '  v 
The  procedure  is  best  called  digital  fij!tering  since  it  is  directly 

.  analogous  to  analog  filtering  of  the  analog  .data  prior  to  digitization. 
We  can  implement,  in  the  digital  domain,  the  same  result  as  aji  analog  - 
filter  in  analog  world. .  We  can  have  low  pass,  higH  pass,  band  pass  and 
^  notch  filters.    In  addition,  we  can^'also  implement  filters  for  which  no 
counterpart  exists  in  the  analog  world.    As  a  quicks  example,  for.non-. 

*real-time  systems,  we  can  implement  filters  with  no  frequency-depenjient 
phase-.shift.  However,  let  me  repeat,  that  no  digital  filter  can  compen- 
sate for  frequency  aliasing  errors.  ^ Once  frequencies  are  aliased,  they 
are  digitally  indistinguishable  from  their  low-frequency  counterparts.  ' 

'Thus  the  frequency  domain  of  the  digital^ filter  lies'  below  t.he  Nyqui'st 
frequency.    However,  this  does  not  deter  Its  usefulness  in  the  genferal 
case,  as  for  example  in  line  frequency  rejection *or  in  transmission  line  ' 
frequency  compensation.    The  corresponding  digital  filter  may  not  be 


c  ■         •  - 

Sanitsky,  A.  sttid  Goulay,  M.  J.  E.,  Anal.  Chem.  36,  l627-3g  (1964). 
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^  extremely  difficult  to  implement,  while  its  analog  counterpai-t'  can  easily 
be  almpst  impossible  for  a  complicated  frequeiicy-phase-amplitude  charac- 
teristic. 

Digital  filters  are  easily  implemented  as  a  sum-product  of  data.  The 
data  may  be  either  direct  input ^points;  or  computed  input  points.'  This 
divides  the  filters  into  two  classes — ^finite  memory  'filters  which  compute 
on  input  data  only,^  and  infinite  memory  filters  that  compute    upon  pre- 

.  viously  computed  filter  points;    The  difference  ^between  the  two,  is  easily 
illustrated  in  Fig.  9  that  shows  the  corresponding  responses  to  a  step 
function  of •  the  two  two-point  filters.   .Since  these  filters  all  correspond 
to  a  moving  window,  they  all  represent  a  contribution.    Digital  filters 
are  most  easily  described  in  terms  of  their  Z-transform.    Since  that  is 
beyond  the  scope  of  this  paper,  the  reader  is  referred  to  the  literature.^ 

■  However,  it  'is  sufficient  to  say  that  the  filter  consists  of  a  linear 
equation  relating  the  output  to  a  sum  of  products  of  coefficients  times 
previous  input  or  output  points.    Thus  the  filter  is  a  f inite * difference  * 
equation  and  is  directl)  analogous  to  differehtial  equations  in  the  analog 
vorld.    As  an  qxample,  the  simplest  five  point  smoothing  filter  is: 

Xn  =  -  yn-1  +  ~''yn-2  +  ^  yn-3  +  ^  yn-4  +  |  yn-5 

The  corresponSing  lea^^t  square  filter  is: 

Xn  =  ^  yn-1  +  g  yn-2  +  12  yn-y+  g  yn-4    ^  yn-5 


^ Jury,  E.  I.,  "Theory  and  Application  oX  the  Z-Transform  Method,"  Wiley  (1964). 
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Although  it  cannot  "be  implemented  in  the  on-line  case,  the  numerical 
Fourier  transform  is  another  example  of  a  numerical  filter.    Once  data  is 
transformed  from  the  time  to  frequency  domain,  any  phase -frequency  operation 
is  a  simple  vector  dot-product  operation.    The  amplitude  or  phase  at  -any 
frequency  can  be  readily  multiplied  by  any  constant  or  variable  to  produce 
the  desired  result.    However,  the  reader,  is  cautioned  against  arbitr^y 
constants  unless  he  is  familiar  with  the  consequences  in  Fourier  Series. 
In  particular,  the^roblems  with  truncation  and  the  Gibbs  Effect  may  pro- 
duce results  other  than  desired.    Other  types  of  numerical  transforms  can 
be  employed  to  further  treat  data,  but  one  should  be  very  familiar  with 
the  transfonn  properties  prior  to  usage.    A  known  transfo;rm  does  not 
necessarily  produce  desired  results. 

In  conclusion,  numerical  treatment  of  on-line  data  is  severely 

restricted  by  the  user's  knowledge  of  numerical  properties  involved.  ' 

Some  are.  viery^  obscure  indeed,  and  very  difficult  to  anticipate.  Thus, 

,it  is  advised  tjiat  the  user  first  investigate  known  functions,  followed 

by  function  similar  to  his  dat^  set  prior  to  experimental  investigation. 
♦ 

e  \      ^  ^, 

To  summarize,  I 'quote  R'.  Hamming,  ^*The  purpose  of  computing  is 'insight, 


not  numbers.'* 


Hamming,  Richard,  "Numerical  Methods  for  Scientists  and  Engineers," 
y.oGrav-Hill,  1962.     *  « 


1-12 


BIBLIOGRAPHY-- 


Blackman,  R.  B...and  Tukeg,  J.  W.,  "The  Measurement  of  Power  Spectra," 

Dover,  1958. -  "  .  >.  . 

Fjaanks,  -L."  E. ,  "Signal  Theory,"  Prentice-Hall,  I969.. 

Gold,  Bernard  and  Rader,  Charles  M. ,  "Digital  Processing  of  Signals,"  " 
McGraw-Hill,  1966.  -  " 

Hamming,  R.  W,  "Numerical  Methods  for  Scientists  and  Engineers  " 
McGraw-Hill,  I962....  ,     ^  '  ~, 

Jenkins,  G.  W.  and  Watts,  D.  G.,  "Spectral  Anedysis  and  Its  Applications," 
Holden-Day,  1968.  •  ' 

Korn,  Granino  A.,  "Random-Process  Simulation  and  Measarements,"  McGraw- 
Hill,  1966. 
^> 

Monroe,  Alfred  J.,  "Digital  Processes  for  Sampled  Data  S:^stems,^ Wilev, 
1962.     •  ^      '  ^ 

Morrison,  Norman,  "Introduction  to  Sequential  Smoothing  and  Prediction  " 
McGraw-Hiil,  1969.  '  ,  ^         •  .  . 

Ralston,  'Anthony,  "A  First  Cause"  in  Numerical  Analysis,"  McGraw-Hill,  1965. 


J 


1-13 


FIGURE  CAPTIONS 

Figure  1.  Sampled  Aliasing 

Figure  2a.        Bandwidth  Chromatogram 

Figure  2b.        lOH    Bandwidth  Chromatogram 

z 

Figure  2c.        IH    Bandwidth  Chromatogram  * 

Figure  3a.        Successive  Approximation  A/D  Converter 

Figure  3b.        Successive  Approximation  Converter  With' Drift 

Figure  4.  ^ample  and  Hold  Ope.ration 

Figure  5.  Amplitude  Response  of  an  RC  Filter 

Figure  6.  Dual  Slope  Integrating  Analog-To-Digital  Converter 

Figure  7.         Amplitude  Response  of  Gated  Integrator 

Figure  v^8a.        Quantization  Error 

Figure  8b.        A/D  Conversion  with  Dither  _  > 

Figure  9  Impulse  Response  of  Two-Point  Finite  Memory  Filter 

Figure  10.  .   Impulse  Response  of  Two-Point  Infinite  Memory  Filter, 
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Abstract;  A  brief  treatment  of  the  statistical  moments 
of  data, -'estimation  of  imprecision,  and  propagation  of 
error  usi^g  numerical  analysis  is  given.     A  method  for 
taking  input  data  with  their  estimated  imprecisions 
and  propagating  the  imprecision  through  a  computer 
program  to  arrive  at  a  final  result  and  its  estimated^ 
imprecision  is  developed.     The  theory  is  developed  for 
cas.es  where  the  iinprecisions  ^in  the  data  are  uncorrelated  ' 
or  independent  of  one  another.     The  method  allows  for 
distributions  of  error  which  are  nonGaussian,  but  is 
illustrated' with  a  normal  case.     Current  programs  can 
easily  be  modified  to  incorporate  the  error  analysis 
subroutine,  with  some  nuisance  in  notation  and  input- 
output  format.     Computation  time  will  increase  by  a 
factor  of  ten  for  error  propagation  computations. 

Introduction :     The  computer  systems  now  av^iilable  often 
free  us  from  major  concerns  of  minimizing  computation 
time.     Attention  can  now  be  turned  to  using  the  computer 
for  a  more  complete  use  of  the  data,  namely,  for  the 
determination  of  the  uncertainty  iri  the  final  result. 
The  six  or  eight  figure  numbers  produced  by  the  computer 
printout  are  completely  unrelated  to  the  number  of 
digits  to  which  the  number  should  be  "rounded  off", 
and  a  casual  analysis  of -error  can  be  quite^  misleading. 
The  error  propagation  method  described  below  can  give 
t^e  researcher  a  well  founded  estimate  of  the  precision, 
of  his  results.     At  the  same  time  it  provides  a  tool 
for  determining  which  factors  in  the  data  are  leading 
to  the  most  serious  imprecisions. 

Instruction  in  the  use  of  computers  to  reduce  data 
taken  in  undergraduate  laboratories  is  incomplete  unless 
error  propagation  is  included.     Students  can  best 
appreciate  the  problem  of  "weak  links"  in  the  data  chain 
and  the  danger  of  overconf idence  in  computed  results  if" 
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the  estimate  of  imprecision  is  carried  out  aloi^g  with 
the  main  computation.     The  method  below  was  designed 
with  ^the  begiijner  in  mind.     While  it  can  be  expanded 
*to  serve  more  sophisticatec^  needs,  it  can  be  applied 
almost  as  an  added  packaged  subroutine  to  most  simple 
programs ^used  in  the  physical  chemistry  laboratory. 

Approaches:  Various  techniques  have  been  developed  to 
cope^with  the  problem  of  error  estimates . (Hahn  and 
Shapiro^)  First:  the  Monte  Carlo  technique  requires  that 
a  number  of  program  runs  be ^made^  each  with  a  different 
set  of  input  data  chosen  from  a  statistically  represen- 
tative set  of  data  values  chosen  about  the  mean  valuer 
of  the  inpiK  parameters.     The  distribution  of  resnalts 
can  then  be  analyzed  to  determine  the  imprecision  of 
the 'mean  value.     The  method  is  rather  lengthy  and  not 
easy  to  .insert  into  current  programs. 

'Second:  A  step  by  stop  algebraic  determination  of 
the  propagated  error  m^y.be  used.  CStrobel)  Some  skill 
at  taking  derivative^  is  required,  and  the  computer 
program  ma^  need  considerable  expansion  to  evaluate  - 
the" resulting  equations.  Third:  A  numerical  different- 
iation** about  the  mean  value  of  a  computation  can  be 
used  in  a  Taylor  series  expansion  about  that  point  to 
get  statistical  moments  about  the  mean  value.  This 
combines  the  best  of  the  two  mjathods  above  and  results 
in  a  more  convenient  modification .  to  current  programs^'- 

Theory  of  Moments:'  The  statistical  moments  of  a  set 
of  N  values,  x^,  ab.out  zero  are  ^defined  (Kendall  and 
Stuart)  as      '      *  ^  -  -  ' 

where  j  is  an  integer  larger  than  ze-ro.     The  first  of 
thdse  is  called  the  mean,  x  =  M, .  The  moments  of  the  • 
set  of  data   (in  this  case  we  mean .  repeated  measuSfements 
of  the  same  experiment,  differing  in  value  only  because 
of  experimental  imprecision)  about  the  mean  are  ^ 
particularly  useful  for  unimodal  distributions  which 
have  a  monotonic  decrease  of  probability  of  observation 
as  the  possible  value  gets  farther  away  from  the  value 
most  likely  to  be  observed,  the:  mode;     In  suCih  cases 
the  shape  of  the  distribution  can  often  be .defined  to 
satisfaction m  terms  ojf  the  firs-t'  few  moments  about 
the  meap.     These  are  defined  as 
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Here  L]_  is  zero  by  the  definition  of  X.     The  variance , 
1»29  is  the  sqjuare  of  the  standard  deviation  ,  or  .  The 
lack  of  symine;try  about  the  mean  is  measured  by  L3.  A 
normalized  quantity  called  the  skewness  is  defined  as 

p  =  L3/L2  '  while  kurtosis  is  defined  as  K  =  ^4/^2  * 

The  .latter  is  a  normalizecj  measure  of  peakednesSf  , 
that  is,  t^ow  sharply  the  distribution  peaks  near  the 
mean^  relative  to  the  height' in  th^ "wings <far  from 
the  mean.  .  . 

The  M  values  are  somewhat  easier  to  compute,  but 
the  L  values  are  simply  derived  from  them. 


<  Estimation  of  Moitients  for  fiata;     Rather  than  determining 
the  statistical  moments  for  each  parameter  (at  each 
value  in  its- range)  ekperimentaily ,  one  can  make  some 
reasonable  estiihates  about  the  limits  of  expected  error 
for  each  parameter*'    For  ^acb  parameter,  use  the  me'an 
value  y:  (the  value  observed)  ,  the  lowest  expected  value 
A  (smaller  than  99.9%  of  the  values  expected,  to  be 
pbsefved  if  replicate  observations  were  made) ,  and  the 
highest  expected  value  B   (larger  than  99.9%  of  the 
anticipated  replicates).     These  can  be  used  with  a>simpie 
and  general  distribution  ^^orjn  which  is  readily  related 
to  the  statistical  moments.    (Ellenberger )  • 

This  polynomial  distribution' h [x]  is  related  to  the 
normal  distribution  of  the  standardized  norma-1  ^riate  z, 
which  has  a  distribution  g[z]  ,  by  the  relation  '  -B'Vi 

X  r  9  2^  +        *  cz  -i^  d  ,  and  note  that  -  ' 

Specification  of  the  coefficients  and  relation  to  the 
L  and  M  values  is  most  easily  accomplished  through  some  ' 
more  relations ,  -derived  . from  A^B,  and  y:  above.  The 
.    distribution  span  S=B~A  is  related  to' the  moments^by  ' 

-^C^        j2^7^/S'L\/^JS'- 2ja)      which,  re  duces  to  eir-'for 

&ymme,tric  distributions.     The  skew  ratio  R=(y-A)/S  is 
defined,  in  a  complex  manner,  but  for  13.3<R<0.7  it  cart 
'  ])e  approximated  by  the  form 
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<7>        ^  T  ^  *         '       From  these  we  can  get 

<.>    Ly-  S^r^T  •      ■  . 

<*»>    L,=  'rft"^)^/''       for  0.3<R<0.7,  and  then  get 

The  coefficients  relating  x  to  z  are  defined  as 

<"> ,  5  -    [ifnT^TfTi  -  j]  • 

Kn>     t      S_  (i-Zd) 

</y>  ^  c/  r       ^  . 

This  procedure  may  be  skipj3ed,  of  course,  if  a  symmetric 
distribution   (L3=0  or  R=0.5)-is  assume^i,  in  which- .case 
the  standard  deviation  may  be  estimated  directly. 

Theory  of  Propagated  Error;     If  a  computed- result  y 
'depends  on  several  input  variables   {xi,X2, •  *x^, . .)  whose 
errors  are  uncorrelated,  then  the  statistical  moments 
of  y  can  .be  calculated  using  tlie  functipnal  dependence 
y=ftx*s]  of  the  resul^t  on  the  input  parameters  and  the 
known  or  estimated  statistical  moments  of  the  x*s.  This 
is  accomplished  (Scarborough),  by  expanding  the  function 
in  a  multivariate  Taylor  series /    Using  the  notation 

for.  the  mean  of  th^  mth  parameter,  and  expanding  to 
second  order  only,  we  get  . 

K  .  '  'V 

3a 


1-25 


Computer  Assisted  Valuation  of  Error 

The  first  statistical,  moment  'about*  zero  is  then. 

The  second  moment  about  the  meah  of  y  is 

and  the  third  and  fourth  moments  about  the  mean  are 


Thus ;  given  a  known  function  and  the  statistical 
moments  for  the  input  parameters,  the  appropriate 
partial  derivatives  may  be  obtained  analytically  and 
used  in  equation  in  the  computer  program  to  get 

the  moments  of  the  computed  result.     An  alternative 
route  makes  use  of  a  subroutine  to  determine  the 
derivatives  numerically  .by  varying  the  input  parametei: 
values  about  their  mean  values.     The  latter  technique 
is  chosen  for  furthjsr  development  here,  since  it  requires 
no  analytic  dif  ferent'iation  .with  the  consequent  added 
programming.     It  can  be  used  with  most  current  programs 
if  minor  alterations  in  input' and  output  format  are  made 
and  a  simple  and'^general  subroutine  is.  added. 

Numerical  Differentiation:     Evaluate  y=:f[Xj^'s]  at  several 
Values  of  one  of  the  variables  x^^,  these  values  spaced 
equally  by  hj^^  about  the  mean  TTnif  keeping  the  other 
variables  at  their  mean*  values'  Yy^.     Differences  between 
the  y  values*  thus  obtained  are  labelled, 

<x/>     4^^.  c  At^.^^  -  'or,  in  general, 

<ClO    A^^.  -  A^  '^c^r  "  ^^  '^i  •    '^^^  partial  derivatives 

are  related  to  these  differences  by  infinite  series,  but 
a  truncation  of  these  series  to  the  first  few  terms  gives 
satisfactory  results.     For  the  first  few  derivatives, 
the  truncated  series  approximations  are  (Scarborough) 
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Computed  for  each  dnput  parameter/  using  hjf^=  V^^m'^^ 
some  other  convenient  interval,  and  are  combined  as 
indicated  by  equations  .If  there  afa.  further 

computations  where  y  is  involved  'and  error  propagation 
-is  desired,  these  results  can  be  incorporated  in  the 
next  stage  of  computation. 

Notes  on  Implementation;    *The  program, given  in  the 
Appendix  calls  a  standard  subroutine  whenever  there  « 
is 'an  error  propagation  to  be  done  in*  the  main  program. 
The  calling  statement  replaces  the  l^ne  used  for  the 
equation  computation  in  the  old  (unmodified,^  without 
the  error  propagation)  program.     The  equation  is  now  ^ 
listec}  in  a  second  subroutine  which  is  used  in  the 
numerical  dif f erentiatiqn.   .The  equations  thus  listed 
are  in  a  standard  form,  y(N3)=some  function  of  x(i)'s. 
This. simplifies  the  form  of  the  standard  subroutine, 
although  it  has  the  unfortunate  effect  of  obscuring'  the 
•meaning  of  the  program  to  the  casual  reader.  That  is, 
varicibles  which  in  the  original  program  were  easily 
distinguished  because  different  letters  wer^e  used  'to 
represent  them  will  have  to  be  changed  'to  x(l) ,x(2) ,etc. 

L 

The  program  given  here  uses  only  the  mean ^ and  the 
standard  deviation  of  input  parameters  and  computes 
only  these  two  measures  for  the  output  results.  Thus 
only  equation  16  and  the  first  term  of  17  are  used. 
The  readin  format  of  the  old  program  and  the  ^scaling 
and  printout  of  the  old  program  must  be  changed  to 
accomooate  the  new  symbolism  of  x*s  and  y"s  and  to 
make  room  for  the  standard  deviation  next  to  the  mean. 
A  more  complete  moment  analysis  would  require  fndre 
space  and  computer  time,  but  would  be  equally  straight- 
forward to  setup.  ^ 

Appendix:     The  following  program  is  written  in  the 
BASIC  language  for  the  Dartmduth  Time-sharing  Netwojrk. 
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100  REM  THIS  PROGRAM  CONVERTS  THE  READINGS  'qf' A  N  / 
102  «  'IMPEDANCE  BRIDGE  TO  PARALLEL  R  AND  C  DATA. 
104     'PUT  DATA  IN  AT  LINE  3000  FFt  FREQ(KHZ>*  D  READING* 
106     'C  READING.  ♦♦♦PROGRAMMER:  R.D.NELSON-05753 

110  LET  M=8000^ATN(1 >       'A  CONSTANT 
200  DIM  X(30>*E(30>*M(5*5> 
205  DIM  Y(10> 

210  READ  Nl   •# INPUT  PARAMETERS  WITH  ERROR 
215  R^A^\N4  'DUMBER  OF  DATA  SETS  TO  RUN  THROUGH 
220  FOR  1=1  T0,N1 

230    READ  X(I)*E(I>   'MEAN*  STANDARD  DEVIATION 
240    NEXT  I         .  ' 
245  REM  FREQ*a*C  ARE  INPUT-S 

250vLET  N3=l   'CALC  H"F^D  ■ 
255  GOSUB  4000  'TO  AVOID  ERROR  -CALC*  USE  60SUB  5000  / 
260  LET  N3-2  'CALC  CP»C/(l+Hr2> 
S65  GOSUB  4000 
270  LET  N3=3  'CALC  RP»( 10t6>/(M^F^D^CP> 
275  GOSUB  4000 

280  IF  ABS(X(5)-X(3)  ><0.  UX(3>  THEN  400 
290  PRINT  "LARGE  CORRECTION  TO  CAPACITANCE" 
400  PRINT  "FREQ( KHZ )»••;. - 

402  LfiT  N2=l  <  ■ 

404  GOSUB  6020 

410  PRINT  "D  READING^"; 

412  LET  N2a2 

414  GOSUB  6020 

420  PRINT  "C  READING-") 

422  LET  N2»3 

424  GOSUB  6020 

440  PRINT  'TARALLEL  CAPACITANCE^PF)^"; 

442  LET  N3=2  -  - 

444' GOSUB  600<5  ' 
450  PRINT  "PARALLEL  RESISTANCE<'MEG0HMS>»^'; 
452.  LET  N3=3    -  . 

454  GOSUB  6000      '  »  *    .         "  '   '  - 

456  PRINT 

460  LET  N4=N4-1  "    ~1      •  . 

470  IF  N4>0  THEN  220  - 
500  STOP  •  f- 
3000  DATA  2»2 

3010  DATA  1*0.001*   0.00015*0.00001*  210.567*0.008 
3020  DATA  10*0.01*   0.13*0.04*  210.567*0.002^. 

4000  LET  Y=Y1=0  'SUBROUTINE  FOR  MEAN  AND  STD  DEV  OF  CALCULATION 
4010  FOR  1=1   TO  N1+N3-1      'EACH  POSSIBLE  PARAMETER 
4020     LET  E=E(I>/2     'SET  INCREMENT  SIZE 
4030    LET  X=X(I)-3*E 

4040    FOR  K=l  TO  5   'THE  SET  OF  Y'S  FOR  ONE  PARAMETER  VARIATION 

4050      LET  X(I>=X+K^E 

4060       GOSUB  5000 

4070      LET  M(K*1>=Y(N3) 

4080      NEXT  K 
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)  •  " 

*4090  FOR  L=2  TO  .5  'DIFFERENCES  IN  THE  Y  SET 

4100  FOR  K=l  TO  6-L 

4110        LET  M(K,L)nM(K+l,L-l)-M(K>L-l)   'SEE  EQ  22. 
4120        NEXT  K 

4130  Next  l  .  ,  ^  . 

4140  let  x(i)-x+3*e    'reset  the  x  '*  . 

4150    LET  Y-Y+M(2*3)-M(l,5)/12  'SEE  EQ  24. 

4160  LET  Yl«Yl+(M(2,2)+M(3*2)-(M(l,4)+M(2,4>)/6)t2  'SEE  EQS  17*23 
4170    NEXT  I 

4180  LET  X(N1+N3)=2*Y+M(3*1)     'THE  MEAN  OF  Y,   SEE  EQ  16 

4190  LET  E(Nl+N3)=SQR(Yl )   'THE  STD  DEV  OF  Y 

4200  'RETURN  ^ 

5000  ON  N3  GO  TO  5010*5020,^030 

5010  LET  Y(1)»X(2)*X(1) 

5015  RETURN 

5020  LEJ  Y<2)=X(3)/( 1+X(4)t2) 
5025  RETURN 

5030  LET  Y<3)=1000000/(X(5)*X(2)*X(1)*M) 
5035  RETURN 

6000  PRINT  X(N1+N3);"+-";E(N1+N3) 
6010  RETURN 

6020  PRINT- X<N2);"+-";E(N2) 
6030  RETURN 
9999  END 


RUN         (on. the  Dartmouth  Time-sharing  Network) 
CAVE    ,         12  APR  7i  14»57 

«  ^ 

FREQCKHZ)"*'!  .  .+-  0.001 

D  READING"  0.00015  +-.O.ppOOt 

C  READING"  210.567*. +-  0.002 

PARALLEL  CAPACITANCECPF)-  210.567  +-  2.00113,E-3 
PARALLEL  RES I  STANCE (MEGOHMS)"  5061.33  +-  335.965 

LARGE  CORRECTION  TO  CAPACITANCE 

FREQCKHZ)"   10.   +-  0.01 

D  READING"  0.13  +-  0.04 

.C  READING"  210.567  +-  0.002 

PARALLEL  CAPACITANCECPF)"  85.3291  +-  30.2851 
PARALLEL  RES I  STANCE (MEGOHMS)"   1.75014  +-  0.671412 


TIME:  0.458  SEC.  (includes  compilation  time) 
READY^ 
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CORRECTED  FLUORESCEIICP  SPFICTRA  BY  COMPUTER  POSTPROCESSING* 
V,  A\an  Mode,  D^vid  H,  Sipson,  Carolyn  A,  Pa.ile2r 

Lawrence  Radiation  laboratory,  University  of  California 

Livf^rmorp,  California    9^550  '  ' 

INTRODUCTTON 

The  availability  of  relatively  low-cost,  high-quality -fluor- 
escence instruinentation  has  enabled  the  routine  usp  of  fluorescence 
analysis  by  many  laboratorifts.    In  quantitative  analytica]  work, 
one  need  not  be  concerned  with  +'he  sppctral  response  functions  of 
thp  Instrument  because  analysis  is  perfonned  at  one  wavelength  and 
with  an  empirically  determined  calibration  curve.    However,  if.*"  the 
instrument  is  used  for  qualitative  analysis  ox,  for  ouantum  effi- 
ciency determinations,  a  corrected  spe-^trum  is  required,  (A 
corrected  sT)ectrum  is  defined  as  one  for  which  the  observed 
ir+ensity  data  have  been  coi^^cted  to  reflect  the  spe^'+T^l  res- 
ponse of  the  exciting  source,  the  optical  system,  and  the  phntnmul+i 
pTier-detection  system.) 

Rv«»ry  instrnjment,  even  "^-^oti  th^  <?a">«  *Tiam'Tac**'ur'^'^,  has  ^ 
unique  total  sp^ctiT*!  r^^sT^nns*:* /   T^Hoi^'^ore ,  th'=*  comp^^'Hson  of 
cpeo+ra  between     fft^rr>r\'¥  ins+ru'"pn+''       with  pu'bii,??hpd  Hat?? 
can  ^<=*  '*.^'='"'<=*ad1np  or  ir^p^sih'''^,         -^^  ur'^^'^T+unate  th***  t^.^t , 
^•^  ^^octy>l  c^at'^  ^  ^r*^'" !  mt*^  havo  pot  bee'^  cc'^^cted, 

-•iior^f  r^'^^'^ns  for  ■^hi'"  omiss'^o'^  hoy*^  v^r^r^n  +^f»  vpt*^'"  *^^^h 
4y>ef»njmp>^tption  for  ^iilt'^ma+^'^ally  ^^^rrluo^ng  cn'»^prt»H  d'^t**^ 
+ho  1  qho'»^i '^t''*  eff'^rt  inv^Tv^d  in  mpmiiillv  <*o'»''^c+iif^  •^>*om  ^ 

pfi/^>»o  ^onv^^t on^ T  instrument'', 

Sevo'h*"'  '>'r+irTps  ^^vp  hp^r  ptibl^sbed       "**he  O'^irec+^on  of 
f ii)^>*o<;rpn^p  s'^pcpty^  '>nd  t^^e  tech'^iqyen  for  dptermininf  the  ^^li^ 
bration  factors,  "       Thp  rrating  ef-f^iciegpies  of  the  Pxci+?»tT<^*^ 
5»rd  emiss*''"^  '^onochromatcrs  are  =essentiany»const'*'^+  over  their 
lifptinips,  '  Once  dpte"nnin'=*d ,  +be  appropriate  g^tinf  cnrr«^+-'on 
factors  nan  b*:*,  applied  bv  means  of  electrical  or  iriechanical 
circtiits  o"^  by  postprocessing  of  the  da+a  on  a  corpv.ter.  The' 
response  ftinction  of  the  photomu]tir>?ler  is  also  ^latively 
constan+  and  can  be  checked  pe'^iodical^y  wl+b^a  «5tandard jl^mr*  of  ^ 
known  output.    Like  the  grating;  tJOTrrectlons,  tbe  -oho+oTnuTti pTier 
response  C'^'n:ection*=?  '^an  be  mad^  durinp*  +be  rins+'^'^ocer^sinr  o"f*  +bc^ 

Althou!Th  systeips  cj^p  bp  desirn^^^'d  t^ai  win  "ipir^  <*<^'prp^+-< /^r%c 
durinf'  qa^rnl-^ «^<t^  -the  pT«*r»t'»^ica^  a^^^^or  Tno^hpinioal  sv^+ej^s  ■rq^i'^rp'^ 
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are  cc)mplex,  expensive,  and  require  continual, maintenance.    If  it 
is  necessary  to  replace  a  photomultiplier  tube,  it  jnay  be  necessary 
to  recalibrate  the  entire  correction  system;  a  time-consuming 
operation.    However,  these  response  functions  are  easily  handled 
by  a  computer  arid  can  be. quickly  changed  to  reflect  changes  in 
instrumentation , 

^    To  obtain  a  true  corrected  spectrum,  it  is  also  necessary  to 
correct  for  the  spectral  distribution  of  the  source.    For  post- 
processing of  the  data,  a  simple  technique  is  required  for  deter- 
mining the  spectral  distribution  of  the  excitation  source r  this 
distribution  must  remain  stable  for  several  hours,  and  abnormal- 
iti^  flicker  or  arc  wstnder)  must  be  quickly  detectable. 

We  have  developed  techniques  to  fulfill  these  requirements  for 
postprocessing  spectral  data. 

Our  calibration  and  monitoring  techniques  combined  with  rapid 
postprocessing  of  the  instTnimental  data  has  enabled  us  to  produce 
high-qizality,  corrected  spectral  data  with  conventional  instrument?? 
for  a  minimum  capital  expenditure, 

INSTRUMENTATION 

An  Aminco-Bowman  spectrophotofluorometer,  Model  SPP  4-8203, 
was  modified  to  provide  digital  output  -for  wavelength  and  inten- 
sity.      With  the  output  from  the  wavelength  digj.tal  voltmeter  as 
a  trigger,  a  logic  circuit  was  designed  to  record  the  jfluorescence 
intensity  data  on  punched  paper  tape,    A  block  diagrsCm  of  the  equip 
mert  is  shown  in  Pig,  -1  and.  a  logic  flow  chart  in  Fig,  ?, 

Although  the  photomultiplier-microphotometer  (Aminco-Bowman 
Model  10-26)  has  rated  output  of  1  V  at -100^  meter  deflection, 
the  output  was  found,  to  be  essentially  linear  up  to  1,8  V  (an  80^ 
overrun  of  the  me+dr  indication).    Thus  the  instrument  ranges  are 
effectively  expanded,,  and  data  are  provided  which  shDuld  not  be 
lost.    With  a  simple  fifth  level  ASCII  coding,  it  was  possible  * 
to  obtain  four  significant  f l^aires  of  data  while  recording,  only 
the  three  least  significant  figures.    Therefore,  intensity  d^^ta  , 
from  0,000  to  1,99^  ^^ere  available  with  a-nln-^.num  of  punchefd  ' 

o-i+put,    ^  ^  ^  :     .  . 

To  produce  the  highest  density  of  data^^r  character;: 
punched,  several  arbitrary  limitations  were  iinposod  on  th^;  '  ' 
system.    A^^pectrum  would  always  star**  at',. 200^'™  and  i^.tid  9)t.  > 
799  nm  (600  data  ]^ints  or  40  Teletype  lirifes  of  15  wdrd^  ;ea!ctl  V 
and  data  would  , be  recorded  at  a  fixed  inopemettt .  of  ;1  nm'*  .tln^dsr  .  . 
these  conditions,  it  was^ necessary  to  generate  iSfavelohgt/i-^^ 
intensity  pairs;  intensity  data  alone  wer^  .^sufflcierit*: ' 'It  y^s,  , 
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necessary  to  ensure  that  the  ayste'n  wWd  record  only  one  inten- 
sity datum  at  each  wav^len^th  and  would  terminate  recording  if  a 
datum  was  missed.*   In  addition,  the  manual  termination  of- a 
.    spectrum  prior  to  tha  normal  79^.  nm  had  to  be  accomplished  in 
such  a  way  as  to  retain  .tape  format.    A  l5-Hord  TTY  line  had  to 
be  completed  and  the  associat-ed  control  characters  punched  to 
ensure  compatibility  with  the  computer  hardware  and  software. 

Our  lo^ic  circuit  (Fig.  2) 'provides  all  the  necessary  feai' 
.  -tures  to  ensure  proper  collectioh  of  the  data  at  high  speed  and 
.At  a  high  tape  density  (7J./^  d^ta  charactera/lOO  characters- 
punched).    Detailed  wiring  diagrams  of  the  logic  and  punch  cir- 
cuits are  available  from  the  author^-.     "  ■  " 

The  required  instrumental  correction  faptors  are  either 
measured  directly  (e.g.  the  excitation  correction)  or  are 
independently  determined  and  then  njanually  prbcessed  to  give  a 
proper  int^at  file  (e.g.,  the  photorajiJtiplier  re^lpons?).  -Once 
determined,  the  correction  data  becaine  a  part  of  the  computer  ' 
library  until  replaced  or  updated  to  reflect  instrument^g.  change^' 

A  full  description  of  the  calibrati6n  of  the  sys1:em  and  the  . 
determin|.tion  of  the  correction  factor  library  has  been  published, 

13 

"        •         DISCUSSION  ;  .  , 

The  LRL  Computation  Center  is  comtKssed  of  a  number  of  diritkl 
cpmputers,  including  three  CDC  6600 's  and  two  CDC  7600''s         '  / 
The  machines  are  operated  in  a  modified  "time-share"  environraenr^' 
V     n^S^v'^f''  ^""^"^  *°  ^"y-  °^  the -five  machines  f\tvi  hi! 

(Fig   3)         ^  network  system  involving  PDP-6  and  -8  com^ter^/ 

The  existence  of  this  large  digital  computation  facility, 
and  its  relatively  inexpensive  local  access,  eliminated  consider- 
ation of  a  small  dedicated  computer  for.  our  application.    Three  • 
data  transmission  systems  were  considered »    punched  cards,  mag- 
netic tape,  and  paper  tape.'  The- equipment  and  interface  costs 
plus  the  necessary  manual  transportation  to  the  Computation 
.Center  (V   0.25  miles)  eliminated  punched  cards  and  raametic  taw 
as-the  primary  mode  of  data  coi;ieotion.    (It  should  be  noted 
both  cardf^^nd  t^ipe  are  us»''  foir  lonff-tem  slt^rag-^.'i 

In  addition  to  the  Inw  -o-f^  fr^-.  Rqujtjmpnt  ;)nd  the,  1  ntnrfac« 
punched  paper  tape  held  the  great  advintagb  in  a  time-Jhare 
.environment  of  direct  transmission  to  the  central  processor  via 
an  ASR-33  Teletype.    It  was  relativel^r  easy  to  design  ftie- pa'per 
tape  ^interface  So  that  the  data  colleotton  rate  waS  limited  bv' 


1-33 


instrumental  parameters  rather  than  by  the  punch-    Because  our 
data  has  essentially  infinite  life  once  recwded/  (as  opposed  to 
data  used  for  instrument  control,  for  exaDvple)y/the  slow  data 
transmission  rate  of  the  Teletype  is  not^.Jidtnr,    The  code  uas 
written  so  that  once  a  tape  was  start^tT;  the  central  processing 
computer"^ provided  all  necessary  Teletype  start-stop  instructions. 

Our  coding  was  done  on  the  CDC  66OO  using  FORTRAN  IV  language. 
Th6  logic  flow  cha.rt  for  the  processing  program  is  shown  in  Fig.  ^ 
^,    The  retention  of  the  raw,  uncorrected  data  file  (File  *A  in  the 
figure)  allows  immediate  recov(9ry  if  the  user  selects  an  inappro- 
priate opt|on  or  wishes  to  serially  output  data  only  partially 
r^orrected  %  one  of^  the  library  functions.    Because  'the  uncorrected 
data  axe  still  the  primary  results  of  the  experiment,  they  are 
the  only  datft  retained  long-term  on  cards  or  magnetic  tapp, 

There  are  several  1mport.ant  feat^'^^s  of  postprncessins  of 
fluor-^sc^nce  data,    On**  is  the  ability  to  see  the  effects*  of 
earh  on>-rection  f^nption  on  ti^^e  data,    This  can  be  particularly 
important  whe'^  working  n«»ar  t^e  maximum  limits  of  thf>  ins+rumen+ 
¥h'^^  one  coT-Tftotion  factor  ray  n^^tly  dl'^+ort  t^^  data,  For 
edyoatlnpal  purpose*?,  thp  individual  oontrlbution  of  p^ch  func- 
tion can  be  displayed  and  the  overall  iTr^^ortance  of  *he  correction 
technique  den^onstrat^d. 

If  an  output  fortpjit  Is  fixed,  one  will  surely  find  occa5?lors  : 
when  a  different  foraat  Is^the  only  one  that  will  properly  display 
a  result,    Tlsin^:  +he  postprocessing  technique,  we  have  essentiallv 
complete  freedom  An  displaying  the  data;  wavelength  in  nanomet'^i^^r 
or  wavenumbers,  output  in  quanta  or  intensity, 

While  the  general' application  of  the  technique. gf  post- 
processing has  been  previously  discussed  by  Drushel     and  others, 
we  believe  our  specific  use  of  digital  postprocessing  of  fluores- 
cence data  demonstrates  the  general  usefulness  of  the  technique 
to  a  large  nurnber  of  users  In  education,  ^research,  and  industry^ 
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Fxo,  1.  Blod:  diasraxD  of  the  modified  spcctropbototluorometer. 
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LEAST  SQUARES  AND  STATISTICS  JN  CHEMIS^TRY 

iJohn  Overend 
Mptecular,  Spectroscopy  Laboratory 
University  qf  Minnesota,  Minneapolis,  Minn^ 

\-    ■  ■• 

In  all  branches  of  chemYsery  we  frequently  encouater  the 
^problem  of  determining > empirical  values  for  the  parameters'  In  the 
mathematical "liibdel  of  a  physico-chemical,  phenomenon.    Usually  we 
haye  more  experim'ental  data  than',  there  are  parameters  to  be  deter- 
mined  and  we  resort  to  statistical  analysis  based  on  the  principle^ 
of  least  squares.  .  The  basic  idea  is  quite  simple;  suppose  w6  have 
a  linear  relationship  between"  the  explaining  parameters,  x . ,  and  ' 
the.  observables,  y^,  i-€.». 


(1) 


where  the  coefficients,  J^^  ate  determined' by  the  model  ana-Are 
known  in  advance.    The  problem  is  to  determine  the  best  estimates 
of  the  explaining  parameters  such  ihat  tbe  calculated  values  of  the 
observablesj  y^,  match  closely  the -experimental  values,  y^.  The 
criterion  a^plied^ is  that  the* sum  of  the  squares  of  the' errors, 


(2) 


.1 

be  a  minimum.    Differentiating  eq  (2)  partially  with  respect  to 
each       and  setting  the  derivative  equal  to-  zero  we  obtain  the  set 


ERIC 


4^/ 
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of  notmal  equation^  . 

V 

and  the  solution  of  these  gives  the  least  squares  estimates  of  the 
explaining  parameters,      •    The  normal  equations  of  eq  (3)  are 
conveniently  written  in  matrix  notation  as 


J'y    =    X  J  '  (4) 

Here  J  is  a  matrix  of  the  coefficients  J^"'  (cf >  eq  (1)),  y  is  a 
\  ^ 
vector  of  the  observables  and  x  a  vector  of  the  explaining  parame- 

terk*     In  matrix  form  the  least  squares  problem  is  nicely  set  up  to 
be  programmed  for  a  computer. 

If  we  have  relatively  more  confidence  in  some  of  the  exper- 
imental data,  these  can  be  gi^en  greater  weight  in  the  least  squares 
analysis  by  requiring  that  the  "weighted  sum  of  errors  squared'^ 

I  Pi^i  >        ^  minimum.     In  matrix  notation  this  gives  a  set^ 
of  normal  equations  , 

J'  P  y    ='  J'  P  J  X  (5) 

where  P  is  a  diagonal  matrix,  the  elements  of  which  are  the  weights 

associated  with  the  corresponding  observables.    The  least  squares 

estimates  of  the  explaining  parameters  are  given  by 

^  ♦ 

,  X    =     (J'  P  J)"l  r  iP  y.  (6) 

If  the  relationship  between,  the"  observables  and  the  ex- 
plaining parameters  is  nonlinear,  y.  =  f . (x  ) ,  it  is  linearized  by  | 


expansion  in  a  Taylor  series  and  the  J^-'  coefficients  of  eq  (1)  are 
then  the  partial  derivatives  dy^/dXy    In  that  case  the  least 
squares  is  reduced  to  the  linear  problem  except  that,  rather  than 
determine  the  explaining  parameters  directly,  the  least  squares 
procedure  is  used  to  determine  the  corrections  to  be  applied  to  an 
initial  (assumed)  set  of  explaining  paramet;ers,  x^.    We  define 

^y^  as 


Ay,  =  y,  ;  f,(xj0) 


(7) 


and  determine  the  corrections,  x  (to  be  added  tu  x^i,  from  the 
equation  \  ^  ^'^^ 


Ax  =  (J'  P^J)7^J'  P  Ay 


(8) 


which  is  analogous  to  eq  (6) .    Since  the  problem  is  nonlinear  it 

is  necessary  to  repeat^he  least  squares  calculation  i^teratively 

until  the  calculated  corrections  AXj  become  arbitrarily  small.  ^ 

The  least  squares  determination  of 'the  parameters  in  a 

problem  has  an, additional  divjLdead;  we  obtain  also  estimates  of  the 

« 

statistical  dispersions  (the  uncertainties)  and  the  correlation 
coefficients  of  the  parameters.    We  state  without  proof  that  the 
square  matrix  (J'  P  J)"*^,  when  multiplied  by  an  appropriate  coeffi*^ 
cient  c,  gives  an  estimate  of  the  varlance-covariance  matri:t,  E. 
The  diagonal  elements  of  I  are  the  squares  of  the  statistical  dis- 
p^rsions  of  fhe  corresponding  explaintrig  parameters*. 


and  the  offjdi^gonal  elements  of  I  give  the 


correlation  coefficients, 


s 


(10) 


The  estimation  of  €he  coefficient  c,  thrpugh  which  we  obtain 
*  the^variance-covariance  matrix,  is  a  recurrent  problem.    It  is  us.e- 
ful  to  consider  two  cases;,  the  first  is^  that  in  which  the  model  of 
the  chemical  problem  implied  by  eq  (1)  is  relatively  good  and  the 
statistical  errors  originate  in  the  uncertainty  in  the  physical 


measurement  pf  the  observables,  y^.  If  the  random  error  in  each-, 
measurement,  a{y^},  can  be  estimated  from  the  analysis  of  the  ex- 
perimental  conditions, 'c  is  determined  by 

c  =  (1/f)  Z  p^a^{y^}  ^  (11)' 

where  f  is  the  number  of  statistical  degree^  of  freedom,**  i.e. ,  the 
^^nuii\ber  -bf  data  points  less the  number  o^^^x^laining  parameters,  A 
familiar  example  of^^^l^    class  of  situa^tion  is  the  analysis  of  a 
, simple  rotation-Vi^rkTion  band  of  a  diatomic  molecule/    The  wave- 
numbers  of  the  incjiyidual  f-otationa-1  lines  in  "the  band  are  the. 
observables  and  there  is  a  small  random  error  associated  with  the 
measurement  of  each  dv^e  to  instrumental  noise  and  the,  difficulty  of 
pickirtg  out  the  center  of  the  line,    Th^  observed  wavenumbers  of  the 
lines  are  then  fitted  by  least  squares  to  an  equation  of  the  form  . 


m  vib 


+  (B'  +  B")m  +5  (B'  -  B")m2  + 


(12) 


I  i 


and  the  explaining  parameters  v      ,  (B'  +  B")  and  (B'  -  B*^)  are 

vlb 

(12)  is  an  excellentj mbdel  with  sound  quantum 


determiiled,  Eq 


mechanidal  foundations  and  the  only  poprce  of  rantiom  error  in  the 

do  ; 
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derived  parameters  is  that  arising  in  the  experimental  determination 
of  the  wavenumbers  of  the  lines  in  the  spectrum*  / 

The  second  case  we  consider  is  the  estimation  or  the  coeffi- 
cient  c  if  the  observables  are  determined  w^th  relatively  high 
precision  but  the  model  is  marginally  adeq/ate  to  describe  the 
phy-sical  phenomenon,  usually  through  limitations  in  the  theory. 
As  an  example  of  this  situation  consider  the  empt/ical  relationship 
of  ti4  bond  length,  r^,  in  a  diatomic  molecule  to  the  stretching 
force  constant,  k  ,  through  Badger's  rule^  which  may  be  stated  as 


(13) 


where  C^^  and  d^^  are  empirical  parameters  and  are  assumed  to 

depend  onlyon  the  rovf  of^he  periodic  table  in  which  the  atoms  i 

and  j  lie.    In  general  r^  and       may  be  determined  with  negligible 

statistical  error  and  the  origin  of  the  uncertainty  in  the  derived 

parameters  C^^  \ad  d^^  stems  from  the  approximate  na^re  of  Badger's 

rule.    If  we  take  the' experimental  values ^f  r    and  K    for  a  number 

e  e 

of  molecules  which  are  expected  to  have  the  same  values  of  C^^  and 
d^j  we  may  treat  the  r^  as  observaT^les  ^nd  usevle^st  squares  to 


obtain  the 


"1  '■ 

i  best  e 


estimates 'of  the  Badgerls  rule  parameters.  To 


determine  ,tlie'  yarlance-covariance  matrix  for  these  it  se  iifis  reason- 


able  to  use  a  Valde  for^lthe  coefficient!  t  which  retlects 
qualitjy  of  pe  fiti"<of  eq  .(l:^)  to  the  ej(jp4riment-al  dita. 


usual,  ther^forei  to  estimate  c  by 

-      c  ^'s2/f 


It 


e  ovfrall 


(14) 
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vhere  f  is  the  number  of  degrees  of  freedom  and  S"^  is  the  sum  of 
»  ♦     .  * 

the  squares  of  the  differences  between  the  experimental  r^  values 
and  those  calculated  from  the  experimental  values  of       and  the 
least  squares  estimates  of  C.,  and  d... 

.    An  impo.r'tant  ppint  to  note  in  the  least  squares  analysis 
of  experimental'  data  arises  if  there  is  some  intermediate  re- 
duction of  the  data  before  the  final  determination  of  the  ex- 
plaining parameters.    Suppose  we  have,  as  before,  a  set  of  primi- 
tive observables,* y,  and,  a  flual  set  of  explaining  parameters,  x, 
but  that,  rather  than  fit  the  x  directly  to  the  y,  we  first  reduce^ 
these  to  an  intermediate  set  of  parameters,  z,  and  from  these  dete^- 
min/g  the  x.    This  particular  problem  cropped  yp  in  our  own  work^  * 
on  anharmonic  force  constants  from  the  spectroscopic  energies  of 
vibrational  states  which,  in  some  cases  we^'iHrrSt^educed  to  the  an- 
harmonic frequencies,  w    and  the  anharmonic  coefficients,  x 

S  ^  8  S' 

The  data  reduction  scheme  may  be  diagrammed  as  follows: 


yx 


(vibrational  energies) 


(anharmonic  force  constants) 


In  t 


le  direct  adjustment  qf  the  y  to  the  y,  \ie  have 


,  '  P  Ay  =  J'  '  P  Ij  Ax-.  ' 
yx    -y  Ji      ryx    -y  ryx  - 


5, 


'A 


(lb 


1-u 


ty  is  the  diagonal  weight  matrljfe  attached  to  the  primitive 
data,  v..  iNow,  since  by  definition. 


and 


\     Az  «  J  Ax, 


Ay  «  J    J  i  Ax 
-  -yz-zt 


(16) 


and,  tiinfie  ^Iso  by  definition 


Substituting  eq  (16)  and 


J    'J    'P  J 
^zx  -yz  -y-yz 


anB,^  if 


I? -J 


yx 


J    *  Ji  J 

--yx  -yz-z^c 


Ci7j  in  e<i  (1^), 


z  »  J  'P  J    J  Ax 

-zx  -yz  -y-yz-zx  - 


(17> 


(18) 


we  compaife  this  wilth  jthe^^fjyanrt^of  the  normaX  equations  used 


wh'en  Ax  is  computed  directly  from  Az,  i.e.^ 


'P  a'z  W  J  ''P  J  Ax, 
'  --zx  -Z       \    -zx  -z-zx 


we  see  that 


ijd  iso  we 


n|c     t  th^ 


e(ers. 


of  djta 

car^; 


through  the 
>  apart  fron 


coinclule  tha': 
'  whomever 


e^iijiHrse  of  the 


in  practical 


'  J    'P  J  . 

-yz  -y-yz  , 

for  consistency  in  the  ^^tatistical 


we  pass  through  an  int^nnediate  stage,  ^e 

I  •  I 

proper  weight  matrix.    (It  is  ^,interest  ing  to 


trivial  scale  factor,  the  weight  matrii  P 


(2(0 


treat- 


.-z 


riarice-covariance  matrix  for  the  z  jJaram- 
least  squal^es  calculations  in  which  one  ^ 


attempts  to  adjust  a  relatively,  large  number  of  parameters  to  a  few 
.experimental  data,  even  though  there  may  be  more  data  tnan  parame- 

ters,  this  does  not  guarantee  that  all  the  (parameters  ^^e  deter- 

\1      I  I  I 

minable,  -  Such  difficultUet  btecome 'manif esq  when  we  attempt  to 

invert  the  matrix  J'PJ;  j[]{5 there  are  fever  bata  than  paiameter^s  thia 

J    '  \ 

jljinverse  does  not  lexis  t.    Even  if  ther6 


matrix  ia  singular  and  \th 
*are  mpre  Idata  than  paratte 
conditioned  and,  the  resik|ts 
in  the  nonlinear  least  aqi 
to  s^upply  .sonje  constraint  tb 
is  how  to  accomplish  this  in 

■    •  ■  1 

which  we  have  experimented 


singular  its  rank  may  |be * det ermined  conveniently  by  diagonalizing 

,        !  ■  .  •  1  ^  ' 

and  finding  the  number  of.np 

but  ill  cond£ti|oneid  it  will  not  have  zero  .jelgenvalues  but  some  wil 
i>e-var[^  sijiall/J  We  may  rej^ard.the  diagonalization  of  tie  normal- 


rs,  the  ;m'atrix  may  be  extremely  ill* 
bf  inversion  unreasonable  -  especially 

I        \\  '  -J  , 

es.  pr3blem»    Clea^rly  there  is  then, need, 
stabilize  the  problem  s^nd.  the  question 
the  most  effective  way.    An  ide^  with 
s  the  f ollowintt^  if  the  matrix  J'PJ  is 


zero  eigenvalues.    If  it  is  nonsingul^r 


equation  matrix  as 


of  the  problem 


given  l^y 


the  trans flormat ion  to  ^  new  set  of  ilncorrelated 

the  ori^im  1  para'meterji 
icierlts  of  the  l|lielar  cOmbi|iat  lonj  being  . 


parameters  which  are  lii;i;fear  combinations  'p^f 

I      1  if 

thle  coe 


the  elements  o 


the  dprresponding 


nal|Lzec  matrix  J'PJ  is  eAS'ily  inverted  by 
of  kacK  element  ani  give^  :he  \ 


dispersions  and  indicate  which 
explaining  parameters  is  ill-'de 


eigenvecitjpr 


simply  Cak 


aris  nce-covjariancie  mat  rix,  of  the  neir 


uncorrelated  parameters*    'fhe  smajlii  eigehv^lues  correspond  to  iatge 


The  diago 


ng  th(e,  invers 


line'^r  combiLhation  of  the  original 
termined.    The  useful  constraints  to 
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apply  to  the  problem  are  then  identified  as  ones  which  have  a  -large 
Projection  on  the  eigenvector  of  the  ill-determined  uncorreJLated  * 
parameter  . 

The  4ise  of  statistical  techniques  in  the  analysis  of  the  ^ 
data  obtained  in  chemical  experiments  has  grown  explosively  since 
computers  became  readily  available  and  it  might  at  this  time  be 
appropriate  to  r'emember  that  these  techniques  deal  only  with  random 
errors.    They  are  inappropriate  for  handling  systematic  errors, 
mistakes  and  de^f iciencies  in  the  model  used  to  interpret  the    "  / 
results  atid  it  is  particular Ijr  important  to  ^uard  against  these 
possibilities  -  it  is  so  easy  to  bury  them  in  the  computer  analysis 
of  the  data.  •  It  is  important,  therefore,  to  examine  the  final  fit 
of  the  model  to  the  original  data,  preferably  by  plotting  a  graph 
pr  alternatively,  by  examining  carefully  the  differences  between    /  ,  ^» 
the  observed  and  calculated  values  of  the  observables  apd  checking 

that  these  fall  in  a  reasonably  random  pattern  with  no 'systematic 

« 

trefnds.    The  computer  has  made  data  analysis  systematic  and  simple  .  , 

iff 

*but  it  is-  no  substitute  for  a  careful  examination  of  the  physics 
ft' 

and  chemistry  of  the  problem.  ' 


1.  R."  M.  Badger.  J.  Chem.  Phys.  3,  7-10  (1935). 

2.  I,  Sukuki,  M.  A.  Pariseau  and  J.  Overend,  J.  Chem.  Phys.  44^, 
3561  C1966).  I 
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1  « 

The  Paman  effect  is  charactBriffid  bv  poor  signal  to  noise  ratios. 
In  addition,  sPectra  often  apnear  as  weak  sij^nals  on  high  backgrounds 
Because  of  these  problems,  some  time  ago  we  began  to  consider  auto- 
mation of  our  ^aman  instrument  for  digital  data  acquisition,  signal 
aver.aging,  and  computer  assisted  data  reduction.    This  program  has 
.now  made  considerable  progress. 

Most  instrument  automation  systems  feature  on-line  data 
acquisition  using  a  variety  of  computers.    Scherer  (1)  has  con- 
structed such  an  automation  system  for  a  Raman  instrument  using  an 
IBM  1800  computer,  and  Bernstein  et  al«  (2)  have  used  a  PDP-§  computer 
in  an  on-line  system.    For  several  reasons,  we  have  chosen  to  use 
off-line  automation  of  our  Raman  spectrometer,  in  which  a  hard-wired 
system  controls  the  spectrometer,  executing  commands  read  sequen- 
tially from  paper  tape,  with  the  data  being  punched  out  by  teletype 
on  to  paper  tape.    Data  reduction  is  done  subsequently  u^ing  a  Nova 
computer  in  the  laboratory. 

The  choice  of  an  off-li"he  svstiem  was  made  based  on  the  following 
^  considerations!  1)  The  ^aman  spectrometer  is  a  verv  stable  system. 
\nur  laser  power  fluctuate*^ verv  little  even  over  the  course  of  a  16 
r\^r  run,  and  there  is  j?encrsilv  no  svstem  deterioration  over  long 
time  Periods  unless  it  involves  sample  xlegradation.    2).  There  is 
little  decision  making  to  be  done  during  the  course  of  a  Raman 
.spectrum.    Thft  operator  can  set  all  instrument  parameters ^d 
need  not  be  changed  over  long,  time  periods.    Thus  there  seemet^^ 
He  little  need  to  provide  the  cjecis ion-making  power  of  a  computei 
3)  ^oT  those  spectra  where  digital  data  acquisition  and  signal 
averaging  could  provide  definite  improvements  over  analog  methods, 
rather  long  times  are  required.    These  range  from  ca.  4 *to  15  hours. 
Thus  the^time  needed  to  prHnt  out  the  tape,  and  the  time  needed  to 
load  the  output  tape  into  a  computer  after  'the  run,  is  verv  small 
compared,  to  the  time  needed  to  acquire  data.  4)  It  was  felt  desirable 
to  carrv.  out;^  as  much  of  the  data  reduction  as  possible  within  our 
own  laboratory,  rather  than  reiving  on  outside  computer  facilities.' 
With  the  off-^line  svstem  and  Nova  computer,  almost  no  danger  exists 
of  losing  the  data  due  to  computer  failure.    Further,  the  svstem, 
once  written,  can  fun  unmodified  if  this  is  desired,  for  several 
years.    Our  experience  with  computer  centers  has  been  that  large 


1-4.8 


periods  of  time  are  spent  in  minor  revision  of  programs  due 
to  ma1or  system  changes.    5)  Since  the  computer  is  not  needed 
during  the  experiment,  it  can  do  other  things.    In  our  case, 
the  computer  is  used  for  control  and  data  acquisition  in  a 
Fourier  transform  infrared  spectrometer.    This  arrangement  has 
the  added  advantage  that  Raman  and  infrared  spectra  are  plotted 
using  the  same  computer  and  plotter. 

The  discussion  of  the  Raman  automation  system  is  divided 
into  three  parts:  Control  of  the  spectrometer,  nature  pf  the 
digital  output,  and  data  reduction. 

Instrument  Control;    Figure  I  shows  the  functions  of  the 
controller  built  for  us  bv  the  Digital  Automation  Co.  of 
Trenton,  New  Jersey .    Signals  go  from  the  control  box  to  the 
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Fig.  1.    Schematic  control  of  spectrometer 


Spex  Ind.  spectromete^r*^  The.  essential  features'wlth  which  - 
the  spectrometer  must  be  equipped  Include  a  stepping  motor  and 
photon  counting  amplification.    The  motor  makes  a  large  number 
of  steps  per  wavenumber  Increment,  usually  72  steps/wavenumber, 
though  bv  use  of  a  gear  box  this  can  be  Increased  to  72  steps/. 001 
wavenumber  If  this  Is"  desired.    In  addition' to  controlling  the 
position  of  the  spectrometer,  ^resetting  time  and  counts  for  a 
data  collection  cvcle,  and  continuously  counting  and  scanning,  t2^e 
controller  will  also  rotate  the  polarization  analyzer  and  half 
wkve  Dlate  for  automatic  recording  of  depolarization  ratios. 
>  \     ,    ^  ' 

Commands  are  given  to  the  spectrometer,  through  the  controller, 
using  a  paper  tape  reader  which  'is  on  the  ftont  of  the  control  boxi 
Any  number 'of  commands  can  be  listed  for  sequential  execution.  The 
commands  Include:  Posltloh-set  the  spectrometer  to  a  particular 
frequency:  Increment-  step  through  a  spectrum,  taking  data  for  a 
preset  length  of  time  or  to  a  preset  count  limit  at  stated  Intervals 
Fixed-  count  for  presfet  time  or  prjfeset  counts  at  a  fixed  frequency: 
Continuous- 'scan  to  a  given  terminal  frequency,  accumulating  all 
counts  in  ^he. region;  Actuate-  close  any  combination  of  three  re- 
lays to  rotate  analyzer  and/or  half  wave  plate,  and  to  turn  off 
the  laser  at  the  end  of  the  experiment;  End-  terminate  data  col- 
lectioi^    The  format  of  the  commands  is  extremely  simple,  ^ 
typtcal  command,  for  incrementally  scanning  from  the  current  pos- 
ition, in' the  positive  Misdirection,  to  a  terminal  frequency  of 
200  cm~l,  in  1  cm-1  increments.,  with  a  preset  couilt  limit  of  9  x 
10^  counts,  is  shown  below.' 

110200+00194. 

For  convenience,  the  exciting  line  is  always  set  as  10000  on  the 
wavenumber  counter,  thus  avoiding  the  problem  of  negative  values  on 
the  anti-Stokes  side  of  the  spectrum.    The  time  is  preset  using  a 
front  panel  switch.  . 

Data  Output:  Ff^re  2  shows  the .  functi^ons  of  the  controller  in 
data'o\itput.    The  controller  contains  its  own  scaler  and  timer. 
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Fig*  ,2.    Schematic  output  from  spectrometer 


which  take  data  directly  from  the  pulse  height  analyzer.  There 
is  not  time  ponstant  circuitry,  so  raw  data  are  obtained.    An  up- 
down  counter  counts  the  motor  pulses  for  incrementing  th,e  wave-- 
number  counter.    This  has  been  found  to  be  completely  satisfactory, 
shaft  encoder  has  been  used.  J 


Th\^form^lt  of  tihe  output  tape  is  shown  below.    It  contains  the 
counts /TElme,  and  \frequency  shift,  as  well  as  relay  closures,  f#r 
each  data  point.    Although  the  first 'three  pieces  of  information 
are  all  puhched  as  six  digit  numbers,  this  is  really  only  necessary 
for  the  counts.    Thk  tape  is  formatted  for  easy  manipulation  by 
the  computer.    Two  space  characters  are  punched  between  each  piece 
of  information  on  a  line,  and  carriage  return/line  feed  characters 
separate  lines.    The  tape  is  punched  completely  in  ASCII  code. 

*    123A56  123^56       123A56  1  CR,Lf 

counts  time        '  /^V  relays 

Data  Tape . 

Data  Reduction:    Once  data  acquisition* is  complete. the  tape  is 
transferred  to  the  Digilab  FTS  system  with  its  Nova  computer  for  data 
deduction.    Although  the  ASCII  coded  tapes  are.  quite  long,  they  can 
be  read  into  the  computer  in  a  short  time  using  a.  high  speed  paper 
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tape  reader..  Even  If  this  Is  not  available,  ohlv  about 
twenty  minutes  Is  required  via  the  teletype.    This  Is  not  too 
severe  a  limitation,  as  the  time  needed  to  acquire  the  data  Is  so  • 
long.  '  ' 

The  computer  as  presently  set  up  has  J2K  words  of  core  memory, 
however  we  plan  to  convert  to  8K  of  core  memory  and  a  i28K.dlsc 
memory  In  the  near  future.    This  will  greatly  expand  ^the  pos- 
sibilities for  refinement  of  the  data. 

The  computer  Immediately  divides  the  counts  by  the  time  for  each 
data  point,  converting  to  a  counts/second  format.    Before  plotting 
the  data  on  the  Houston  Instruments ^DP-1 ' plotter^  the  computer 
enters  a  conversational  mo^de.    The  number  of  data  points,  and  their 
maximum  and  minimum  Intensity  values,  are  printed  on  the  teletype. 
The  operator  then  Is  given  the  current  values  of  the  x  and  y  scales 
for  plotting,  and  can  modify  these  via  ,the  teletype,  based  on  the 
above  Information.    Because  spectra  often  h«ve  high  backgrounds, 
the  operator  can  ask  to  have  the  data  offset  to  zero,  and  auto- 
matic scale  expansion  *  can  be  selected.    The  option  of  plotting 
points  rather  than  a  continuous  line  Is  also  available.  Finally, 
a  binary  tape  of  the  data  can  be  punched.    .This  Is  much  shorter 
than  the  original  ASCII  t'ape,  and  Is  generally  the  form  In  which 
spectra  are  stored. 

After  decisions -as  to  plot  format  hava  been- made,  the  spiactrtmi 
Is  plotted.    The  data  as  plotted  at  this  point  are  raw  data,  having 
no  smoothing,  either  digital  or  analog.    They  often  appear  .to  be 
quite  noisy  for  this  reason.       •     ■  "  . 

.  "^^urther  data  reduction  options  are  al$o- available.    These  in- 
clude smoothing  of  data  uslhg  either  a  moving  average  or  parabola 
fit,  band  area  determinatida  for  depplarization  ratios,  Computation 
of  the  depolarization  ratio  and  the  uncertainty ,  in  tjiat  ratla  using 
counting  statistics  and  error  theory,*  correction  of  Intertsitles  for 
radiometric  and  scattering  phenomena,'  and  so-called     8  plotting. 
This  latter  technique,  developed  by  Scherer  (3),  is  quite  useful 
for  separating  totally  symmetric  modes  from  the  spectnin if  more 
than  one  symmetry  species  is  Raman  allowed.    Frequency  calibration 
is  done  using  laser  emission  lines.    If,  as  often  occux:3  in  Raman* 
spectra,  bands  are  present  on  a  sloping  background,  the  operator' 
may  have  a  particular  region  replotted  with  a-  sloping  background 
subtraction  to  compensate  for  this.    The  scaling  of  t^e.  spectra  ^ 
is  such  that  infrared  spectra  from  the  FTS  svstem  may  be  plotted 
directly  on  the  Raman  spectra. 

figure  3  shows  the  spectrqm  of  CCI4  in  the  459  cm"^  region.  : 


60 


Pig.  3.    X-'.     of  ccr. 


Fig.  6.    Raman  spectrum  of  p-azoxyanisole 
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This  region  was  digitized  at  0.1  cvT^  intervals  for  a  time  of 
1  second  at  each  point.    A  lower  resolution  spectrum  of  CCIa 
is  shown  l^n  Figure  4,  covering  the  region  from  0-400  cm"!. 
In  Figure  5  this  s^. spectrum  is  plotted  using  the  point  plotting 
mode.    Figure  6  shows  a  region  of  the  spectrum  of  p-azoxy- 
anisole,  f r«n  w: //f^m-l  .    in  Figure  7  this  same  spectrum  is  re- 
plotted  using  background  subtraction  and  automatic. scale  expansion. 


Fig.  7.    Data  frotn  Fig.  6,replotted  with  background 
subtraction  and. scale  expansion. 
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This  system  has  been  in  operation  for  about  8  months, 
although  certain  data  reduction  features  have  been  broyght  up 
more  recently.    Others  are  still  in  progress,  such  as  slit 
function  correction,  line  shape  determinations,  and  least  squares 
resolution  of  overlapping  bands.    During  the  period  of  operation, 
it  has  enabled  us  to'  acquire  spectra  which  could  not  be  obtained 
using  analog  techniques.    These  are  cases  in  which  extremely  high 
bacHgrounds  prevent  data  fropi  being  recorded  in  the  analog  system.  - 
Such  instances  are  frequent  in  Raman  spectra.    The  system  ha's- 
also  opened  certain  research  areas  to  us,  particularly  the  study 
of  Raman  line  shapes  and  errors  in  depolarization  ratio  measure- 
ments, which  were  not'  previously  feasible  using  analog  techniques,  y 
Finally,  it  has  permitted  to  see  our' true  experimental  data  for 
the  first  time,  unmodified  bv  analog  ratemeter  and  recorder. 
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ABSTRACT 

A  data  acquisition  and  equipment  control  system  is  "being 
developed  and  used  in  the  high  temperature  ch^emical  laboratory  at 
the  University  of  Kansas.    The  central  processing  unit  of  the 
system  is  an  8K,  l6-"bit  per  word,  general  purpose,  Hewlett-Packard 
2116B  digital  coii^)uter. 

The  design  specifications  are  given  for  the  hardware  of  the 
system  and  for  the  software  for  one  of  the  experiments,  ,the 
collection  of  ionization  efficiency  cuiwes  with  a  Nuclide  high 
temperature  mass  spectrometer.    The  hardware  configuration 
adopted  and  the  software  developed  for  the  dat^  collection  are 
described.    Examples  of  data  collected  for  H^O    and  ^^3^13  are 

shown*    The  advantages  of  automatic  data  acquisition  over 
previous  methods  of  manual  collection  are  that  (l)  the  data  have 
improved  precision  and  reliability,  .(2)  the  procedure  may  he 
exactly  repeated,  (3)  "the  data  collected  are  permanently  stored 
in  a  very  conveniently  retrievable  form,  and  (U)  further  data 
processing  is  readily  initiated  and  accomplished  in  a  reliable 
fashion. 


INTRODUCTION 


The  high  temperature  chemical  research  program  at  the 
University  of  Kansas  currently  includes  studying  and  character- 
izing vaporization  reactions  under  equilibrium  conditions, 
studying  the  kinetics  of  vaporization  of  structurally  simple 
materials,  and  identifying  solid  phases  by  x-ray  dif fractometry . 
The  principal  apparatuses  being  used  in  the  research  are  a 
Nuclide  high  temperature  mass  spectrometer,  a  quadrupole  mass 
spectrometer  manufactured  by  Electronics  Associated,  Inc.,  an 
Ainsworth  semimicro  recording  balanqe-torsion  effusion  apparatus, 
and  a  Philips  powder  x-ray  diffl-actometer . 
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In  1969 J  with  a  grant  from  the  National  Science  Foundation, 
two  identical  Hewlett-Packard  2116B  general  purpose  digital 
computers  were  purchased,  one  for  the  high  temperature  laboratory, 
the  other  for  the  microwave  laboratory  at  the  University  of  ^ 
Kansas.    The  computers  served  as  central  processing  units  to  data 
^acquisition  and  equipment  control  systems  currently  in  use. 

The  purpose  of  the  work  reported  here  was  to  interface  the 
Nuclide  mass  spectrometer  to  the  data  acquisition  and  .equipment 
control  system  for  the  purpose  of  nollecting  h:Lgh  temperature 
.^ss  spectrometric  ionization  efficiency  curves  on  vaporizing 
^ecies.    Presented  elsewhere  in  ^his  symppsiiim  are  papers  which 
ieport  work  performed  on  the  computer  control,  of  the^ quadrupole. 
Wss  spectrometer  i«.,th,e'*iiigh  teJipeVatut'e  laboratbry .  ^)  -  and  the"* 
development  of  th^  sifter  syst^rii'l^n  the^  microwave  laboj^atqry '(2)\ 


■fHE  USE  OF  THE  MASS  SPECTROMETER 

The  use,  ^of  the  Nuclide  high  temperature  mass' spectroneter  ^ 
is  grossly  different  from  that  ordinarily  Encountered  in  organic 
mass  spectrometry  because  gaseous  f)ressures  are  so  low  and    »  . 
because  temperature  dependences  of  ion  intensities  are  often 
sought.  ',The  mass  spectrometer  i's  used  ta  investigate  chemical 
phenomena' at  high  temperature.    The  rate  of  mass  scan,  for  .example 
is  usually  on.the'order  of  one  a.m.u.  per  5  sec.  at  lOQ'^.m.u.  - 
ThQ  instrument  is  usually  operated  at  a  resolving  power  of 
300-500.    Many  ot*.  the  substances  being  investigated  with  the  i^jass^ 
spectrometer  have  low' volatility .    For' example,  at  2000^K  th^ 
vapor  "pressure  of  T!%0^^  in  the  Knudsen  cell  situated  several 

inches  from  th^  ion  source,  is  of  the  order  of  10  atm. 

The  single- focussing  mass  spectrometer  has  a  60-degree  sector, 
12-inch  radius  magnet,  an  electron  -impact  ion  source,  and  bpth 
Faraday  cup  and  electron  multiplier  (letectors-'which  are  used  in  a 
mutually  ex-clusive  tashion. 

A  Knudsen  cell  containing  a  -solid  sample  is  intrdducdd  into 
a  high  vacuum  system  and  is  heated  by  radiation  and  electron 
bombeirdment .    The  temperature  of  the  cell  is  measured  by  optical 
methods  -either  witl)  a  manually  of)erated  Leeds  Northrup 
disappearin]g  filament  optical  pyrometer  or  with^a'Leeds  &  Northrup 
No.  86^2  automatic  pyrometer.    Thte  gaseou-s  species  effu&e  out-  of 
the  cell  in. a  molecular  beam 'Which  encounters\a  moveable  0.025-''inch' 
slit  that  when  at  a  center  position  allows  a  ^)ortion  of  the  beam 
to  reach  th6  ion  soulrce  region.    The  neutral  beam,  after  passing 
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through  various  discriminating  and  focussing  plates,  is  "bathed  . ^ 
with  electrons  having  energies  to  within  ±0»3  eV  of  a  particular 
energy  in  the  range  2-70  eV. 

The  ions  formed  "by  electron  impact  are  accelerated  through 
a  potential  of  ahout  50OO  V,  are  focus sed  through  another  series 
of  plates,  and  then  enter  the  magnetic  field  which  allows  a  small 
ri"bl)on  of  the  ion  "beam  to  "be  focussed  onto  a  detector*  The 
current  due  to  ions,  termed  the-  ion  current,  arriving  at  the 
electron  multiplier  deteclor,  is  multiplied  through  sixteen 
dynodes  having  a  total  gain  of  ahout  10^  and  measured-  "by  a  Gary 
vibrating  reed  electrometer  (VEE).    The  input  to  the  electrometer 
is  in  the  range  of  0*001  to  30  volts,  and  the  output  is  presented 
to  a  10  mV  strip  chart  recorder •    Ion  detection  is  also  performed 
"by  amplifying  the  pulse  from  the  last  dynode  of  the  multiplier  and 
counting  pulses  with  a  Hewlett-Packard  5201L  sealer • 

The  dependence  of  the  ion  current  on  ionizing  electron  energy 
constitutes  ionization  efficiency  data,  from  which  a  chemist  can 
o"btain  valuahle  information:    l)  The  appearance  potential,  that 
is,  the  minimum  electron  energy  at  which  the  ion  is  detected,  can  ^ 
he  obtained •    2)  Identification  of  different  ions  having  the  same 
integral  mass  is  aided  "by  the  value  of  the  appearance  potential. 
3)  Dissociation  energies  and  "bond  energies  for  molecules  can  he 
obtained,    h)  Fragmentation  of  a  molecule  caused  "by  the  impact  oi 
the  ionizing  electron  with  the  molecule  may  in  some  cases  "be 
recogni2;ed  "by  an  aneO-ysis  of  the  shape  of  the  curve. 

^     Some  of  this  information  can  "be  o"btained  from  ionization 
efficiency  curves  which  have  "been  collected  under  earlier  manual 
operation,  the  procedure^for  which  was  as  follows.    The  electron 
energy  control  potentiometer  was  set  at  TO  eV.  A  particular  ionic 
species  was  focussed  *by* manual  variation  of  accelerating  poten- 
tial, magnetic  field,  and  ion  source  focussing  plates.    For  this' 
measurement  the  potentials  on  the  repeller  plates  were  zero. 
Maximum  don  current,  as  revealed  "by  visual  inspection  of  the  VEE 
output  on  a  meter  or  on  the  strip  chart  recorder,  was  taken  to 
indicate  that  focussing  had  "been  achieved.    The  electron  energy 
was  then  manually  set  at  zero.    By  careful  and  very  tedious 
manual  operation  the  electron  energy  was  varied  approxj^mately  at 
0.25  ^per  second  and  the  ion  current  traced  on  a  strip  chart 
recorder.    Although  this  method  was  satisfactory  for  some  ^ 
purposes ,  it  had  limitations .  "  Consider ,  for  example ,  the 
po^si"ble  pro"blem  of  fragmentation  in  the  Ti-0  system. 


1-59 


Gilles  and  Hampson  (3)  established  that  the  c6ngruent  vapor- 
ization of  Ti^O^"  occxxrs  predominantly  according  to  reaction  (l), 

Ti^O^  (s)  =  TiO  (g)  +  2  TiOg  (g).  (1) 

In  characterizing  this  vaporization  reaction  and  calculating 
thermodynamic  quantities  for  the  Ti-0  system  from  mass  spectro- 
metric  measurements  i^  is  imperative  that  the  ion  currfent  oTjserved' 
for  lifeth  TiOg  and  TiO    "be  proportional  to  their  respective  equilit 

rium     partial  pressures*    If  fragmentation  occurs,  hovever, 
according  to  reaction  (2),    ^  -  ' 

TiOg  +  e"  =  TiO"**  +  0  +  2  e",  -  (2) 

the  TiO    ion  current  observed  is  not- proportional)^ to  P^^^  in  t^ie^ 

Knudsen  cell,  "but -depends  as  veil  ori  V      \    Consequently,  . 

2 

results  derived  from  such  measurements  are  not  thermodynamically  ' 
correct*    In  order  to  ottain  reliable  results,  either  the  extent 
of  fragmentation  must  "be  known  for  a  pcurticulfiu:  ionizing  electroil 
energy,  or  the  data  must  Tie  obtained  at  electron  energies  "belov 
that  energy  required  for  fragmentation.    Manually  collected  dat^ 
have  teen  too  imprecise  for  the  scientist  to  "be  atle  to  detect  the 
onset  of  fragmentation  to  produce  TiO    from  TiOglg)  in  the 

presence  of  TiO"^  arising  from  TiO(g).    Automatic  data  acquisition 
vas  undertaken  to-improve  the  precision  of  the  measvirenlents  arid 
thus  to  allow  the  experiment  to  "be  performed  in  such  a  way  as 
either  to  avoid  fragmentation  or  to  correct  for  it* 


SYSTEM  DESIGN  AND  IMPLEMENTATION 


Six  general  aspects  were  considered  in.  .designing  the  data 
acquisition  and  equipment  control  system  (DAECS)  to  perform 
automated  experiments,  one  of  which  was  to  "be  the  collection  of 
ionization  efficiency  data  with  the  Nuclide  mass  spectrometer. 

1*    The  Central  Processing  Unit>    The  choices  for  a  central 
'processing  unit  in  a  data  acquisition  system  include  a  solid 
state  sequencer  which  can  provide  control  signals  to  various 
hardware  devices;  a  large  digital  computer  providing  time-shared 
interaction  through  a  remote  terminal;  and  a  small,  dedicated, 
general  purpose,  digital  computer  providing  real-time  interaction 
viLh  laboratory  hardware. 
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For  the  automation  of  data  acquisition  in  the  high  temperatiare 
chemical  laboratory  the  central  processing  unit  requirements  are 
the  following •    The  unit  is  to  "be  dedicated  "because  the  rate  of 
data  collection  is  sometim"es  100  per  second,  emd  prompt  service 
is  required •    The  acquisition* of  data  and  the  control  of  hardware 
.devices  are  to  "be  flexible  •    Operator  ^interaction  with  the 
central  processing  unit  is  to  be* easy  and  flexible*    The  acquisi- 
tion   emd  control  functions  performed  d\iring  em  experiment  eire  to 
be  performed  simultaneously  by  the  central  processor •  The 
servicing  of  hetrdware  devices  by  the  sys^tem  is  to  be  on  a  priority 
basis  so  that  integrity  of  data  is  maintained* 

2>    Acquisition  of  Data>    The  data  to  be  acquired  from  the 
•mass  spectrometer  are  the  following:    a  10-60  mV  analog  signal 
from  the  pyrometer,  millivolt  analog  signals  from  voltages  on 
various  focussing  plates  in  the  ion  source  region,  a  millivolt 
analog  silgnal  tiharacteristic  of  the  ionizing'  electron  energy,  a 
0-10 V  analog  signal  from  the»  gaussmeter ,  a  0-10  m^analog  signal 
frop  the  output  of' the  vibrating  reed  electrometer,  a  digital 
signal  from  the  olitput  of  the  preamplifier-amplifler-scaler.  pulse 
counting  chain,  and  a  digital  signal  from  a  time-of-day  clock. 

3r    Equipment  Control,    Tiie  control  signals  for  the  liiass 
spectrometer  equipment  are  of  two  typ^s:    l)  feedback  control 
based  on  data  acquired  from  the  experiment,  and       single-ended  \ 
control  for  .setting  or  changing  an  experimental  variable.  Other 
needed  control  signals  are  switch  closures  by  relays  and  propor- 
tional voltages  from  a  ±50  V  di'gital-i;o-ejialog  converter. 

U>    Operator  Interaction.-  The  possibility  for  intervention 
*by  the  scientist  during  an  exjferiment  is  desirable.  The 
scientist  is  to  be  made  aware  of  the  status  of  the  experiment  by 
means  of  typed  output ,. oscilloscope  display,  or  X-Y  or  strip 
cheurb  recorder  displays.'  During  the  experiment  the  scientist 
should  be  able,  for  example,  to  revise  initial  parameters  and 
direct  experimental  status  changes. 

5.    Data  Storage  and  Retrieval.    The  data  storage  and  data 
retrieval  are  to  be  possible  (a)'by  using  the  computer  ^core 
memory,  (b)  by  transmitting  data  to  an  on-line  storage  device, 
and/or  (c)  by  transmitting  tl:^e  data  to  the  sister  system  In  the 
microwave  laboratory  or  to  the  GE-Honeywell  635  computer  for 
storage.         '  '  . 

^6.    Data  Processing.    Data  processing  is  to  occur  either 
simultaneously  with  data  acquisition  and  equipment  control  or  at 
El  later  time  using  data  stored  previously  on  a  data  storage  device 
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such  as  magnetic  tape  or  disc.    Data  processing  of  results  "by  the 
on-line  machine  is  to  "be  kept  to  a  minimum  and  Is*  only  to  he  . 
performed  if  necessary  to  the  five  aspects  .considered  above.' 
Connections  hetyeen-the  local  computer  and  tt^g  .200  K,  36-hit-per- 
word,  lus  cycle  time  University  computet  are  to  ^be  mad^  to  facili- 
tate transmission  of  stored  data  for  processing  hy.  the  larger  . 
machine .        *  f 

The  specifications  for  the  computer  program  tg  c6llect  ioni- 
zatipn  efficiency  curves  under  computer  control  are  summarized  as 
f  oll(5ws .  '  ,  ' 

^  X.    The  ion  intensity  is  to  he  obtained  hy  pulse  counting. 
2.    The  electron  energy  is  to  he  varied  hy  means  of  a  , 

stepping  motor. 
'3-    The  operator  is  manusLlly  to  set  the  original  electron 
e;aergy  and  initialize  several  other  parameters  at 
execution  time. 

U.    Multiple  ionization  efficiency  collections  (passes)  are 
to  he  possible.    A  single  cJollection  is  defined  as  a 
series  of  incrementing  motor  steps  from  the  initial  to 
the  final  electron  energy  setting. 

'5-    The  average  of  multipl'e  collections  is  to  he  calculated 
immediately  after  the  data  from  each  pass  are  accumulated 

6.    Rejection  of  a  pass -or  portion  of  a  pass  hy  the  operator 
is  to  he  possible. 

T,.,    All  dataware  to^.be  stored  on  a  mass-storage  device. 

8.  {   Tfie  operator  is  to  be  able  to  interact  vith  the  data 

\  acquisit'ion  system  to  halt  collection  permanently  or 
\temporardly,  to  reject  b,  pass,  and  to  obtain  a  listing  of 
the  data  as  collected.  > 

9.  A  real-time  oscilloscope  display  of  the  signal  average 
over  all  passes  for  a 'particular  species  is  to  be 
presented  to' the  operator.  / 

The  hardware  system  satisfying  the  general  criteria  described 
above  is  shown  in  Figure  1  in  which  hardware  4evice^  are  repre- 
sented as  rectangles.    The  central  processing  unit  is  the  HP  2116B 
computer,  a  general  purpose  digital  computer.    It  is'  a  l6-bit-per- 
word  computer  with  8,192  words  ot  core  memory  and  a  memory  cycle 
time  of  1.6  microseconds.    The  computer  has  a  flexible  input/ 
output  (l/O)  system  for  'connecting  and  servicing  peripheral 
devices  and  allows  various  DAECS  operations  to  be  performed' 
sj^multaneously.    Thus  the  computer  can. overlap  its  acquisition, 
processing,  storage,  displaying,  eind  controlling  functions.  The 
dedicated  use  and  speed  of  the  computer  allow  the  development  of 
new  experimental  techniques  which  require  time  measurements  in 
microseconds. 
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In  Figure  1  data  acquisition  devices  are  all  represented  to 
the  left  of  the  CFD  HP  2116B  computer  •  -  The  arrov-headed  lineg 
connecting  these  devices  to  the  computer  show  the  direction  of 
data  flow.    To  the  right  side  of  the  HP*  2116B  in  Figure  1  are 
represented  equipment  control  signals Three  control  signals 
are  also  operator  interaction  devices,  namely,  ,the  oscillo- 
scope, the  X-Y  recorder  and  the  §trip' chart  recorder •  The' 
operator  interaction  devices  are  represented  at  the  "bottom  of  the 
dia^am  and  the  storage  and  retrieval  devices  at  the  top. 

.    The  ionization  efficiency  data  ,are  collected  "by  the  electron 
multiplier-pr.eamplifier  (ORTEC  109A')-amplif ier  (ORTEC  l+35)-.scaler 
(HP  5201L)  pulse  counting  chain/  The  computer  controls  the 
opening  of  the  scaler  gate*  ' 

Variation  of  the  electron  energy  is  achieved  "by  a  double- 
ended  stepping  motor  on  the  elect roh  energy  control  potentiometer 
shaft  •    The  motor  has  200  steps/revolution  with  a  minimum  of 
h  msec  d^lay  between  steps*    The  increment  in  each  step  is  0.05 
eV*    The  computer  controls  the  number  ai;d  the  rotational 
direction  of  the  steps.  /  ^ 

Initialization  parameters  are  entered  via  the. keyboard  of  a 
teletype.    All  parameters  and  data  are  stored  on  aiHP  2870A  disc 
unit  having  one  million  words  of  storage  and  average  access  time 
of  80  msec  The  initialization  parameters  include  (a)  the  name  of 
the  storage  file  on  the  disc,  (b)  the  experiment  title,  '(c)  the 
bottom  and  top  limits,  in  cotxnt^  per  second,  of  the  scope  display, 
(d)  the  total  number  of  motor  steps  per  pass,  (e)  the  numbet  of 
motor  steps  to  be  moved  before  initiating  a  scaler  accumulation, 
and  (f )  the  number  of  seconds  the  scaler  gate* is  to  remain  open. 

,The  oscilloscope  display  is  obtained  through  a  dual  8-bit 
0-lOV,  D/A  converter  with  a*  25  msec  refresh  period  and  a  Hgwlett- 
Packard  ll+OB  oscilloscope.    Operator  interaction  is  possible 
by  means  of  the  teletype  and  ,by  means  of  l6  toggle  switches, 
which'  comprise  a  ont  word  register  located  on  the  front  panel  of 

.the  HP  2116B.    The  options  available  via  five  of  the  sixteen 
toggle  switches  are:    to  suspend  both,  data  collection  and  motor 

^ control ^functions  during  a  pass;  to  suspend  functions  at  the  end 
of  a  pa'ss  thus  enabling  the  scientist  manually  to  refocus  the  ion 
beam  before  the  next  pass  is  started;  1>o  list  on  the  teletype  the 
data  collected  for  the  cui-rent  pass;  and  to  terminate  d$ita  acqui- 
siticm  and  equipment* control,  returning  control  to  the  system 
executive  progrfim.^  ^  >   •  ' 
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RESULTS 


The  data  acquisition  and  eq'uipment  control  ay^tem  wq,s  used 
to  collect  the  Ionization  efficiency  curve  for  H^O  ,  'mass  l8,  a 

-Q 

background  species-"  with  a  partial  pressure  of  ahout  10     atm.  in 
th^  ion  source  region  of  the  mass  spectrometer.    Figure  2  is  a 
reproduction  of  the' oscilloscope  display  after  the  collection  of 
three  passes  had  "been  completed.    One  division  on  th^  horizonteuL 
axis  is  equivalent  to  10  eV;  on  the  vertical  axis,  to  3000  counts 
per_^second.    The  total  number  of  motor  steps  moved  during  the  , 
HgO   data  collection  was  1206  at  intervals  of  6  steps  about  every 

one  second.    The  initial  elecJtron  voltage  control  potentiometer 
setting  was"  10  evl    The  final  setting  was  70  eV.    There  were, 
therefore,'  201  points  displayed  on  the  oscilloscope.    Each  point 
is  the  average  of  three  passes.    Each  pdint  of  each  pass  was 
collected  with  the  scaler  gate  time  equal  to  one  second. 

^  The  reproductions  of  the  oscilloscope  displays  in  Figure_^3 

^    (a),  (b),  and  (c)  show  ionization  efficiency  c\irves  for  Cu^Cl^(g) 

The  collection  was  performed  on  Cu^Cl^Cg)  vaporizing  from  a 

single  crystal  of  CuCl(a)  in  an  dpen  graphite  crucil;>le  which  was 
heated  to  ,215±10^C*  '  At  this  temperature  the  plartial  pressure 
of  Cu^Cl^  in  the  ion  Source  region  wa3  about  lO"     atm.  Figure 

3(a)  is  the  result 'after  one  pa^s,  (b)  is  $he  average, of  the 
results  of  seven  passes,  and  (c)  is  the  average  of  the  resiilts  of 
fifteen  passes.    For  all  parses  the  electron  energy  wets  stepped 
from  7  eV  to '30  eV,  in  intervals  of  0.:^  eV  about  every  0.13 
seconds.    The  gate  on  thfe  scaler  was  open  for  0.1  seconds  for 
each  accumulation.    One  division  on  the  horizontal  axis  of  the 
display  is  equivalent  to  2.5  eV;  on  the  vertical. axis  to  200  ^ 
counts  per  second. 

The  scatter  in  the  CU3CI2    collection  after  15  passes , 

particularly  at  about  26  ey,  was  in  part  due  to  temperature 
fluctuajbions  of  the>  single  X^rysted  during  the  collection  of 
several  passes. 
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DISCUSSION 

The  automatic  acquisition  of  ionization  efficiency  data 
resulted  in  the  following  advantages  pyer  the  previous  collection 
procedure^ 

^  1.    The  data  have  ^impi'oved  precision.    The  data  collected  for 
HgO    at  50  eV  and  shown  in  Figure  2,  for  example,  represent  an 

accumulation 'Of  31,500  total  counts,    The^  error  associated  with 
that  number  of  counts  is  equal  to  177  counts,  op  0,6    percent  of 
the  observed  quantity.    Thidv  precision  can  be  compared  with  a  U.3 
percent  error  associated  witl^  the  strip  chart  recorder  measurement 
of  the  VfeE  analog  signal  at  50  eV  for  .the  same  collection. 

2,  The»coJ.lectio'ns  of  data  are  repeat  able,.    Often,  under 
previous  manual "operation,  the  electron  energy  control  shaft  was 
moved  too  much  Or  too  little  and  at  a  variable  rate.   ;For  automatic 
data  acqiiisition  each  digital  ion  intensity  is  associated  with  a 
discrete  and  definite  electron  energy, 

3.  The  data  are  conveniently  stored  on  a  disc  rather  than  on 
bulky  and  cumbersome*"  strip  ch^irt  paper. 

It,    The  data  are  easily  acgessib^e  in  original  formV  the 
data  acquisition  system  for  further  processing  by  the  system,  by 
the  sister  system  in  the  microwave  laboratory,  or  by  the  large 
University  central  coniputing  facility.    Because  of  this  capability, 
the  scientist  may  design  the  collection  of  data  in  new  ways  and 
may  instruct  the  system  to  process  the  original  data  So  that 
'chemically  meaningful  results  are  available  while  the  experiment, 
is^  being  performed.    This  kind-of  interaction  was  previously  not* 
possible  because  the  manuiSl  data  work-up  delayed  chemic^  results 
by  days  or  weeks ,  r  /  . '  * 

<» 

Future  plans  for  the  system  include  the 'collection  of  • 
ionization  efficiency  data  for  TiO^,  TiO,  and  Ti .from  Ti^Q  and 

Ti|^0^  samples  at  l600^  to  1750^^,  as  the  samples  vapcJtize  to 

conguently  vaporizing  Ti^O^,  ■      *  '  \ 

The  data  collection  procedure  will  be  modified  ITb^TJ-royide  ' 
fo^  coinputer-controlled  refocussing  between  passes.    Piane  are 
aJ^o  being  developed  for  the  determination  of  appearance  poten- 
tials from  ionization  efficiency  data  by  numerical  methoda,*. 

T 

Other  data  acquisition  plans  for  the^  Nuclide  loass  spectrom- 
eter   include  mass  sc^s,  ion  intensities  as  function^  of  temper- 
ature, shutter  profiles,  and  pres,sure  calii)ration  measurements, 

7S  ■  '        .  ■ 
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Overview  -  Session  on  Computer  Assignment 

of  Complicated  Spectra.  *  i 

* 

Conference  on  Computers  in     ^  ^ 
Chemical  Education  and  Research 

Northern  Illinois  University,  July,  19-23,  1971 

W.  t'.  Carnall       '  / 
Chemistry  Division 
Argonne  National- I^iboratory 


^  The  use  of  computers  in  simiJilating  or  analyzing 
complex  spectra  has  been  discussed  in  individual 
>papers  -  in  several  previous  conferences  on  computers 
in  science;  however,  this' appears  to  Ije  the  first 
time  that  the  subject  has  been  swingled  out  for 
special  emphasis.    Certainly  the  role  played  by 
computers  in  spectral  analysis  is  unique,  and  one 
obsei*ves  that  there  is  an  ever  Increa^sing  sophis- 
tication' iri  their  use. 

The  papers  ,presented  in  this  session  are  clrawn 
.from  several  different  areas  of  research,  and  serve 
to  .point  out  some  of  the  important  applications  of 
computers  incurve  fitting  and  band 'shape  analysis. 
The  methods  are  of  course  applicable  to  many  ^other 
specific  problems.    For  example,  Y-ray  and  infrared 
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spectroscopy  are  not  formally  represented  although 
there,  is  also  considerable  activity  in  spectrum 
analysis  in  these  fields  as  evidenced  by  the.  current 
literature.    The  analytical  use  of  Y -ray  spectroscopy 
is  an  example  of  an  application  where  the  required 
curve  fitting  involves  an  extensive  library  of 
standard  spectra  with  the  implied  requirement  of  a 
large  amount  of  readily 'accessible  storage.  In 
general,  the  analysis  of  a  spectrvim  is  accomplished 
at  some  tim§    after    it  has  been  recorded,  but  one  of 
the  papers  presented  here  will  discuss  a  real-time 
analysis  application. 

Several  different  types  of  approach  to  spectral 
analysis  may  be  noted  in  the  work  to  be  discussed. 
For  example,  in  some  applications  it  is  possible  to 
calculate  a  spectrvim  based  on  a  theoretical  model" 
for  the  types  of  interaction  known  to  contribute  to 
the  experimentally  observed  spectrum.  Comparison 
with  experiment  may  then  reveal  the  relative  impor- 
tance of  individual  interactions  or  species,  and 
grossly  wrong  structures  may  become  immediately 
obvious.    The  following  paper  on  NMR  spectra  falls 
in  this  category.    Other  observed  spectra  .stem  from 
•processes  that  are  not  suff icieritly  understood  theo- 
retically  to  permit  simulation  or  perhaps  are  highly 
dependent  upon  the  experimental  condit^ns  under  which 
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they  are  obtained*    In  such  instances  it  may  be  useful, 
to  resolve  the  complex  band  structure  into  a  number  of 
components  based  upon  a  particular  mathematical  function. 
.   This  is  the  case  with  much  optical  spectra,  and  the  use 
.of  a  modified  Lorentzian  ( Lor entzi an  -  Gaussian)  band 
contour,  as  is  discussed  by  one  of  the  present  papers 
is  at  the  same  time ' experimentally  useful  and  con- 
sistent with  the  predictions  pf  a  simple  theoretical 
treatment.    Computer  programs  to  optimize  the  fit  of 
such  functions  to  experimental  curves  are  available 
in  many  computer  libraries     As  noted  in  the  papers 
•  on  ESR  line  shape  fitting,  simulation  is  again  based 
on  the  use  of  Lorentj^g  or  Lorentz -Gaussian  curves. 

In  high  resolution  mass  spectrometry,  the  com- 
puter is  required  to  accurately  deduce  the  center  of  p 
a  line  and  test  the  shape  of  an  envelope  for  unre- 
solved multiplets.    Here  the  data  points  are  both 
accurate  and  form  a  smooth  c\xrve.    A  similar  appli- 
cation exists  in  deducing  the  maximum  energies  of 
various  types  of  nuclear  decay  processes,  but  in 
this  case  the  noise  levels  are  usually  so  high, that 
some  data  averaging  is  required  prior  to  curve  fitting . 

In  all  these  applications  the  interpretation  of 
the  experimental  data  is  now  intimately  rel^tted^  to 
computer  operations.    However,  the  emphasis  on  curve 


73 


2-4  "•  ■ 

fitting  techniques  is  self -limiting.    As  was  pointed 

« 

out  in  a  recent  review  of  absorption  band  shapes  in 
"liquids,  the  ability  to  analyze  certain  types  of 
spectra  has  already  reached  a  degree  of  excellence 
.    that  challenges  the  experimentalist  to  reexamine  the 
physicd  of  the  measurement.    Certainly  this  is  a 
healthy  outcome  of  the  successful  application  of 
computers . 


1  ' 
.  R.P.  Young  and  R.N.  Jones,  Chem  Rev.  71,  219  (1971) 
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A  SCIENTIST'S  VIEW  OP  NMR  SPECTRUM  SIMULATION 

by  —  ' 

Lawrence  C.  Snyder       ^  .  .  ^ 

Bell  Telephone  Laboratories,  Incorporated 
Murray  Hill,  New  Jersey  07974 

Compujter  simulation  is  one  of-  several  tools  to 
answer  questions  and  m^ke  predictions  about  nuclear  magnetic 
resonance,  NMR,  spectra.    My  area  of  interest  and  research  ' 
activity  is  in  the  development  of  NMR  in  liquid  crystal  solvent 
as  a  means  of  precJse  molecular  structure  determination."^ 
Computer  simulations  of  'NMR  spectra  permit  me  to  decide  when 
agreement  of  theoretical  and  experimental  spectra  has  been 
attained;  thus  confirming  an  assumed  molecular  structure,  or 
the  adequacy  of  an  assumed  set  of  magnetic  resonance  inter- 
actions  in  less  familiar  contexts.  '  Computer  simulations  of 
spectra  also  permit  me  to  predict  the  adeauacy  of  NMR  in 
liquid  crystals  for  potential  future  ai^^ilcations  in  structure 
determination.     '     •  *  ^ 

The  principles  of. NMR  are  fairly  well  established, 
but  their  application  requires  considerable  art.    A  .theoretical 
calculatloh  o,f  NMR  spectra  requires  that  tl:ie  system  of  nuclei' 
being  stiidjCed  niust  be  defined       well.' as  the-  Hamlltonian  . 
Which,  specifies  their,  important  interactions  with^each  other, 
the  applied  magnetic-.f ield,  and  other  electromagnetic,  fields 
of  their  envirdnment.    An  important  feature  of  nucleai*  spin 
systems  is  that  the'number  of  states  of  a  system  inicrease^. 


very  rapidly  with  the  number  of  nuclei.'  For  a  system  of  N 

protons  each  having. two  states,  the  total  number  of  states 

N  ' 
Is  2  .    Thus  for  systems  of  over  twelve  protons,  matrices 

describing  the  vector  space  of  states  grow  beyond  the  capacity 

of  modern  computers  and  are  extremely  expensive  to  diagonalize. 

The  Hamiltonian  used  to  describe  NMR  of  molecules 

in  liquid  crystal  solvents  is  rather  simple.. 

The  first  term  gives  the  interaction  of  the  nuclei  with  the 
applied  magnetic  field  H,  with  provision  for  diamag||^tic 
shieldings  in  the  parameters  a.    The  interactions  of  nuclei 

.  of  a  molecule  with  each  other  by  way  of  the  molecular  electrons 
are  described  by  the  indirect  spin-spin  coupling  parameters 
J-j_j,  with  which  you  are  familiar  from  NIVlR  in  isotropi-c  phases.  . 
The  direct,  magnetic  dipole-dipole  interaction  paranieters  D^j 

;are  required  only  for  NMR  in  anisotropic  media  such  as  liquid- 
crystal  solvents.    The  direct  magnetic  dipole  interaction  D  . 

*  .  ■  ;  -      '  '  . 

between  nuclei  i  and  J.       a'raolecule  is  inversely  proportional 
'  .        .  '>         .,  ■  ■  . ■  ^ 

ta-the  cube  of  the" distance' between  the. nuclei.    Its  evaluation 

,  by  speQtrum  analysis.t  makes  possible  the  de-t^rmination  of  ,- 

■precise  molecular  geonjetry..,  It  is  usually. tjhe  ma^Jor  interaction 


which  provides  the  structure  of  NMR  spectra  taken  in  liquid 
crystal  solvents. 

The* main  application  of  computer  simulated  NMR 
spectra  is  facilitate  comparisons  of  theoretical  and  experi- 
mental spectra.    Theoretical  spectra  ca:n  be  preHfiled  as 
tables  of  line  positions  and  ^intensities.    However ^  as  the 
number  of  nuclei  and  their  states  increases  the  length  of  , 
such  tables  increases  rapidly ^and  the  meaning  of  the  entries 
Is  obscured.    Computer  simulation  of  the  spectra  is  made  by 
preparing  a  picture  ofi^he  spectrum. which  results  from 
summing  these  many  lines  with  prope3^  relative  intensity  and 
an  assumed  individual  line  shape  (Lorentziaji) .    The  simulation 
immediately  makes  it  possible  to  distingi^sh  the  overall 
structure  of  the  sp6cftrum  from  the  maze  of  individual  line 
components;  in  other  words  the  f oldest  from  the  trees. 

When  one  is  dealing  with  new  and  unfamiliar  systems, 
comparison  of  these  computer  simulations  w-ith  experimental 
spectra  makes  it  possible  to  decide  whether  the  basic 
Hamfltonian  chosen  is  adequate  to  describe  the^ system.  This 
use  is  illustrated  in  P*igu^e#l  which  shows  €arly  simulations 

of/6He  proton  NMR  of  benzene  in  a  liquid  crystal  solvent. 

♦\  ^  * 

The)  upper-  computer  simulation  spectrum --vwfs  made  using  only 
djfiSect  dipolar  interactions  corresponding  to  a  rigid  planar 
hexagonal  geometry  for  bertzene.    The  central  spectrum  is 


experimental.    The  .lower  simulated  spectrum  was  computed 
using  both  the  direct  and  Indirect  couplings  'nuclei^ 
Dhese  simulations  showed  that  the  main  structure  of^  NMR 
spectra  of  benaene;  in  liquid  crystal  solvents- is  due  to  the 
direct  dipolar  couplings  D^j  of  'the  Hamiltonian. 

When  the.paroper  Hamiltonian  is  well  -established 

<^  «  «  , 

and  one  is  dealing  with  familiar  systems, •  computer  simula- 
tions permit  one  to  deduce  structural  information  from  NMR 
In  anisotropic  solvents.    An  idea  of  the  amount  of  information 
available  was  given  by  the  computer  simulations  shown  in 
Figure  2  for  benzene  containing single  carbon-13  nucleus. 

o  ^ 

The  C-H  bond  lengths  assumed  differ  by  only  0.005  A  but 
produce  clear  visual  differences  In  the  computer  simulated 
spectra.    Similarly,  it  was  easily  shown  that  the  NMR  in 
liquid  crysl4l  would  clearly  distinguish  benzene  from  possible 
isomers  having  a  three -fold  or "greater  axis  of  symmetry. 

The  use  of  computer  simulation  in  an  interactive, 
mode  at  a  Graphic-1  conso],e/proved  to  be  a  convenient  way 
to  determine  a,.small/number  of  structural' parameters  when 
the  NMR  spectra  are  relatively 'simple  as  for  cyclopropane. 
However,  when  the^number  of  pa3^ameta(i?^|^hich  must  be  varied 
Is  greater,  the  irtfeeracftive  mode  of  analysis  is  less 
satisfactory,  part|cular*ly  when  the  number  of  nuclei"  is 
larger  and  the  NMR  spectra  correspondingly  more  complex  as 
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is  illustrated-  by  bicyclobutane .    Small  c^ahges'of  assumed 
geometry  for  bicyclobutane  caused  the. lines 'of  this  complicated 
spectrum  to  move  through  one  another  in  ways  which  defied 
Identification  and  analyses.    The  Graphic  console  was  an 
inadequate  tool  and  analysis  was  completed  by  provision  .of 
a  different  eij^peri^hent,  double  resonance.   .The  computer" 
simulation  in  Figure  3  of  the  proton  NMR  of  bycycl-obutane 
corresponding  to  that  analysis  showed  definitively  its  # 
validity.  - 

_       The  problems  of  co|iputlng  and  analyzing  MR  spectra 

^of  molecules  having  large  ntmiber  of  nuclei  are  met  in  cyclo-- 
hexane,    which  has  twelve  protons.'   The^  Nte  spectruftr  in  a 
liquid  crystal  is  symmetrise  and consists  of  "t'wo -^aussian 
shaped  collections  of.  resonance  lines,    ^'or  jiyclohexane,^ 
the  rrjatrices  involved  are  of  dimension  iOOOxlOOO  and  too  ' 
largfe  for  convenient  and  economical  diagonali.zation .  Even 

_after  a  costly  computer  simulations/  if  possible,  one  could 
not  extract  the  interesting  direct  dipolar  interactions  by 
fitting  these  rather  amorphous  masses  of  resoiiance.  Again 
the  Judicious  way  to  proceed  has  turned  out  to  be  an  improve- 
ment, of  the  experiments  partial  deuteration  of  the  cyclo- 
ft 

hexane  with  decoupling  of  the  deuterons  by  double  resonance. 
Here -again,  computer  simulations,  are  providing  evide/ice 
on  the  adequacy  of  our  understanding  of  double  i^esonance  .in  • 
,this  new  context  of  aiQisotropic  solvents. 
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In  summary,  computer  simulation  is  another  tool  to 
answer  interesting  questions  in  magnetic  resonance.    It  is 
taking  its  place  among  the  alternative  experimental  and 
theoretical  tools.  ^ 


1.  L.  C.  Snyder,  J.  Chem.  Phys.,  4^,  404l  (I965). 

2.  S.  Meib6om^d  L.  C.  Snyder,  Acct'S.  of  Chem.  Res.,  4 
81  (1971) .        .  '  • 
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hiJ""'^^^  ANALYSIS  FOR  DETERMINATION  OF 
UNRESOLVED  METAL  HYPERFINE  SPLITTINGS  IN  ION 
RADICAL*^^^  APP.LICATION  TO  THE  BENZENE  ANION 

M.  T.  Jones,  M.  Komarynsky ,  and  R.  D.  Ratalczak^ 

Contribution  from  the  D'epartment  of  Chemistry 
University  of  Missourl-St.  Louis 
St.  Louis,  Missouri  63121 


'  ABSTRACT 


A  llneshape  analysis  Is  presented  which  allows 
the  extension  of  the  study  of  Ion-pairing  Into  the 
spectral  region  where  the  metal  hyperflne  splitting  ^ 
(hfs)  can  no  longer  be  resolved.    The  analysis  allows 
one  to  quantitatively  evaluate  the  metal  hfs  and  the 
component  llnewldths  from  experimentally  measured 
llneshapes  and  llnewldths.    The  utility  of  the 
analysis  procedure  Is  demonstrated  by  Its  application 
to  the  benzene  anion  radical.    The  unresolved  metal 
hfs  In  the  benzene  anion  radical  ranges  from  -75 
mllllgauss  at  -VZO'C  to  +150  m1111gau*ss  at  -30*C. 
The  component  Ifnewldths  are  found*  to  range  from  — 20Q 
mllllgauss  at  -120**C  to  -650  mllllgauss  at  -^aO^C. 
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INTRODUCTION 

There  hts  been  considerable  Interest  In  the  use 
Of  ESR  techniques,  to  study  Ion-pairing  effects.^  To 
date,  It'has  been  necessary  to  explicitly  resolve  the 
metal  hyperflne  splitting  (hfs)  In  order  to  study  such 
effects.  .Although,  very  recently  certain  changes  In, 
g-values  have  also  been  associated  with  1on-pa1r1ng. 
A  llneshape  analysis  Is  presented  which  wIlKallow  the 
extension  of  the  study  of  Ion-pairing  Into  the 
spectral  region  where  the  metal  hfs  can  no  longer  be  • 
resolved.    The  analysis, al lows  one  to  quantitatively 
.evaluate  the  metal  hfs  and  the  component  linewidth 
(hence  to  obtain  the  "true"  T,)  from  epcperimentally. 
measured  llneshapes  arid  .1  iiiewtdths.    The  utility, of  the 
analyses  procedure  wi.ll  be  demonstrated  by  Its  ap'plica- 
tt^n.to  the  benzene  anion  radical. 4  , 

The  underlying  basis  of  the  analyst^  jirocedure  is 
;jthe  fact  th^t  £SR  spectra^l  llneshapes  are  Strongly'^ 
dependent  .upon  the  metal' hfs  even  when  th'eWrS'di Vidual  ' 
(?ompdnents  ar^  not  Resolved.    The  analys1s«rcons1sts 
af  detennlhing  precisely  what  the  llneshape  looks  like 
as  the  metal  hfs  increases  from  zero  to  the  point 
whiere  the  Individual  comp,on^nts  Just  begin  to  become 
resolsred.  '  For  the  .calculations  described  .here,  1t  is 
assunfed  that  the  3-1  fe  time  of  the  cation-anion  pair  is 
long,  coriipafed  to  1;he  inverse  of  the  metal  hfs. 

"  EXPERIMENTAL  AND  RESULTS 

The  spectral  line  shapes  were' simulated  for  an 
ESR  line  width,  cohs'ists  of.  n^closely  spaced  individual 
component  lines.   ..Ror  example,  in  the  case  of  ion-pair 
for-mation.  with  a  single  sodigm  ion,  four  equally 
inten$«  component  llnes'.are'used.    The  simulatibn 
calculations  were  performed  in-such  a  manner  that 
errors  d^e  to  t'runcaitlon^. of -thtf^ individual  component 
,.lines  fell  otits;;!de  of  thfe  Ve.gloh  V  interest.  The 
'density  of  poinlts  calcula.tgd  was  such  that  the  per- 
centiCge  of  .e^rror  a'ssdciated  with  picj^ing  the  correct 
half-width' it  extreme,  slope  ranged  from  +l%'when 
(Hfs/AH^,)«.0.,to  +0f'25^i.«lie.rr  ("ft^s/AHp )-t .0~and  +0.5«  /, 
when  tlHTs/AHj.)-!^' to  +0.35;  wfren  (=ttfk/AHp)-l  .6  7or 
Lorentzian  ffjid ^GaussTarj  cpnjponent  llneshapes,  respec- 
tively.   The  component  linewidth  is  given  by  aH-. 
figure  ;I.  shows  norma'lHzed.  plots  of  half  of  the  ^ 


derivative  of  the  resonance  llnewhape  for  various 
ratios  of  hfs  to  AH-. starting  with  four  equally  Intense 
Lorentzlan  shaped  components.    The  data  In  Figure  I 
show  that  as  the  hfs  increases  from  zero  relative  to 
AHp  the  llneshape  becomes  less  and  le.ss  Lorentzlan,'  It 
fails  off  In  the  wings  much  more  rapidly 'than. a  Loreht- 
2lan.    For  hfsrAH-  ratios  >0.6 -the.  11n«shape  falls  off 
even  more  rapidly  than  a  single- Gaussian  component* 
Similar  behavior  Is  observed  when  Gaussian  components 
are  used  except  that  the4r  1  Imlting  1 1nes.hape  Is  a 
Gaussian  and  they  fall  off  more  rapidly  than  a  Gaussian 
In  the  wings  as  the  ratio  df-  hf^:AH^'1s  Increased. 

Figure  U  shows  a'  pV&t  of  the  ratio  of  Ihe  calcu- 
lated llnewtdth-  (AH^v  )  td  the  Individual  component 
line. width  (A)lp)  vefSds  the  ratio  of  Hfs:AH^.    One - 
notes  that  as  the  ratio. of  HfstAHp  Is  Increlsed  so  too 
tdoes 'AHq.   .    A  similar  result  Is  observed  when  Gaussian 
componeHts,  are  usfd.    the  results  shown  In  Figures  I 
and  'II  are  changed  by  an  insignificant  amount  If  one 
choses' 'Instead  of  four  equally  intense  lines,  two  sets 
of  four  equally  Intense  lines  with  relative  Intensity 
and  felatlve  hfs  appropriate  to  the.. case  of  ton-pair - 
formation  with  potassium  Ions  Instead  of  sodium. 

The  question  immediately  arises:;.  Are  there  J 
examples  of  the  behavior  of  llnesliapeSv as  described 
a^pve?    The  answer,  of  course,  1s  yes. _ 'Pa«t -stuiiles  on 
very  dilute. water  spVutlons  (9.5  x  lO-^  Mj'o'f  pei'oxyl- 
amlne  di sulfonate  "anion  radica]  h>ve  shown  that  the 
llneslfape  Is  npn-Lorentziati  with  a,  more  rapid  fall  off 
1n  t|ie  wings:*'.   Figure  4  of  reference. 8  shows  .a  normal- 
izecT'plot  of  the  derivative  of  the  resonance  absor*>t,lbn 
-curve  of  peroxylamine  disulfonate-anlon  radica?*  In'  ' 
comparison  w.lth  pure  Lorentzfah  >»nd''6aussi'an  Tin^'    '  . 
shapes.^      •  •  ?  .    '        •  ' 

Careful,  studies  of  .the  lines'^ape  of  the  b'eniene. 
anion  radical  In  solution  show  thit,  a)  it  is-  depend- 
ent upon;  the  benzene  concentratiVn-,  b)'  it...is  tempera- 
turedependent,  and  c)  It  falls  off-moFfr  rapdily  in*  the 
wings  than  do6%  a  Lorentzian  linevO»'»'"    This  behavior 
is,  demon'strated  In  Figure\lII  for  a  soWtton  of  the 
benzen?  anion  radical  in  a  '2:1  mixture  of  tetrahydro- 
furan  and,  1-,2-dimethoxyethane' which  is  0.3M  in  benzene. 
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Figure  III  shows  another  way  of  looking  at  the 
Information  shown  In  Figure  I.    On  the  ordinate  Is 
plotted  the  "observed"  line  position  versus  that  of  a 
pure  Lorentzlan-  line  for  fixed  percentages  of  the 
height.    The  analysis  procedure  consists  of  marking  off  \ 
the. baseline  In  units  of'one-half  the  peak-to-peak 
width.    The  height  of  the  curve  Is  marked  -off  In  per- 
centage of  the  maximum  height  (which  occurs  at  +1/2  the  - 
peak-to-peak  width)  from  the  base  line.    The  dUtance 
(normalized  with  respect  to  one-half  the  peak-to-peak 
width)  from  the  center  of  the  resonance  Tine  to  the 
decaying  wing  at  a  given  percentage  of  the  height  Is 
measured  and  plotted  as  shown  In  Figure  III  against  the 
corresponding  value  for  i  pure  Lorentzlan  line.    A  pure 
Lorentzlan  llneshape  yields  points  which  fall  along  the 
dashed  1 Ine.  ^  ^  , 

Whereas  the  data  plotted  In  Figure  i;T  are"exper1- 
ment9l'',  the  data  shown  In  Figures  IV  and  V  are.  obtained 
from  calculations  of  the  expected  llneshape  of  the  central 
11n,e  In  the  benzene  anion  radical  excluding  and  including 
the  effect  of  the  hfs  arising  from  the  natural  abundance 
of  13c,  respectively,''     Note  that  In  F1<|uVe  IV  there  are 
a  series  of  straight"  lines  coming  out  of  , the  origin.  The 
larger  the  ratio  metal  Hfs:AHj.  the  lower  the  slope.  The 
Inclusion  of  the  effect  of  the  '^c  hfs  leaves  a  portion 
of  the  data  In  Figure  IV  unchanged,  namely^that  portion- 
along  the  abscissa  from  1  to  approximately  3'.  For 
distances  greater  than  3  halfw1dth°Si  the  Individual 
curves  are  pulled  do^n  toward  the  abscissa  as  shown  In 
Figure  V.    Note  the  5ood  agreement  between  the  data  shown 
In  Figure  Ill  and  that  In  Figure. V. 

In  Ffgure  VI  .Is  shown  a  plat- of  the  data  obtained 
from  our  calculations  which  allows  a  rather  easy  com- 
parison ,  of  the  calculatlans  with  experimental  res-ults. 
L1n%.  pos'ltlons  In  units  of  h?flf-w1dths  are  plotted  ,  v 
versus  the  ratio  Hfsr^Hj.  for  a  series  o>  family  of* 
curves  taken  at  different  relative  heights.    Our  calcu- 
lations show  tha^t  the  positions' associated  the  pointy 
a;t  SOX,  60%,  and  4.0%  of  the  maximum"  height  arje  not 
significantly"  perturbed  by  the  presence  of  the  '3c' 
•hfs. '3  , 'Thus;  we  may, safely  compare  "experimental" 
positions  at.  80%,,  6QX  and  40%  of  the  maximum,  height  with 
the  curves  shown  in  Figure  VI.      One  obtains  In 

/  ^ 
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this  way  the  "best"  fit  of  the  experimental  dati  and 
the  value  of  the  ratio  HfsrAH,,  at  which  it  occut's. 
Using  Figure  II,  one  czn  convert  this  ratio  into  a 
number  which  gives  the  ratio  of  the  "observed"-  line- 
width  to  the  component  linewidth.    These  two  ratios 
plus  the  experimentally  measured  linewidth  yield  the 
iriformati.on  we  are  seeking,  namely,  the  metal  ,hfs  and 
t^e  component  linewidth. 

F.igure  VII  shows  the  results  of  such  an  analysis 
for  the  metal  hfs  for  a. large  range  of  benzene  anion 
radii:al  concentrations  and  temperatures.    Of  course, 
the  analysis  is- not  capable  of  determinin-g  the  sign  of 
the  hfs,  although  we  have  plotted  the  data  through  the 
origin.    There  is  no  incontrovertible  proof  that  the 
metal,  hfs  doesr  in  fact  change  sign.    However,  one  can 
argue  in  analogy  with  a  number  or  othei*  ion-pairtd 
radicals  where  the  hfs  values  are  known  to  go  through 
2ero.'^a.'^c,Zf  ,  It  is  significant  that  the, metal  hfs 
values  a^re  relatively  independent  of  the  benzene  con- 
centration and  hence  the  benzene  anion  radical  con- 
centration.' .This  fact  taken  together  with  the  fact 
that  the  jiifethod .  of  analysis  appears  to  work  over  such 
a  large  range  of  benzene  and  benzene' anion  concentra- 
tions suggests  that  the  assumption,  that  the  life-time 
^the  benzene  anion  radical -metal  ion  pair  is  long 
compared  to  inverse  of  the  .metal-  hfs,  is  justified. 14 

Figure  VIII  shows  a  plot  af  the  component  line- 
width  versus  temperature  for  the'same  range  of  con-  . 
dentration  as  shown  in  Figure  VII.    The  linewidths  ai^e 
dependent  upon' concentration,  although  the  scatter  is 
larger  than  one  might  like  to  see.    In  a.dd.ition'^  the 
temperature  dependence  ..appears  to  level  off  at'about 
-80*C/   This  is  t^o  be  contrasted  with  the  i'nh.tfmogene- 
ousty  broadened  line  which  displays  «  minimum  at  ' 
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that  it  is  temperature  dependent  and  that  it 
increases"  with  increasing  temperature.    This  latter       ^  v 
observation  agrees  with  that  of  Danner  and  Ma'kil^  / 

.  for  the  hexakis  (trifluor^methyl )  benzene  anion  / 
radical.    Our  values  are  >S^p  gauss^t  -100**C,  2.79  ^^ 
ga.uss  at  -80"C,  ^and  2.85  gauSs  at  -62"C. 
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13.  Our  calculations  show  that  for  ^^C  HfsrAHp  ratios  • 
greater  than  5  and  metal  HfsrAH-  ratios  less  than 
approximately  0.36  (as  the  former  becomes  larger  so 

does  the  latter  ratio)  that  the  presence  of  the  ^^C  ^ 
Hfs  can  be  ignored  in  the  analysis  procedure.  Very 
few  spectra  have  been  observed  to  date  which  do  not 
fall  into  the  above  categ-'ory.  ^  > 

14.  This  observa^tion  is  consistent  with' the  low  elec*  .  - 
tron  transfer  rate  .between  the  behrene  and  the 

benzene  anion  radica.1  observed  by  Me^linosky  and  f  / 

Bruning  (Ref.  5)  and  suggests  that  the  electron 

transfer  takes  place  with  the  simultaneous  transfer 

of  the  metal  ion  (see  Ref.  10b  for  fan  extended  ^' 

discussion  of  this  point).    We  plari  further  comput- 
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er  simulations  which  will  Include  the  effects  of 
metal  Ion  exchange  ajid  electron  transfer.  However, 
we  are.  it  present,  quite  Impressed  with  the 
success  we  have  had  to  date  by  neglecting  such 
effects  and  the  lack  of>v1dence  that  they  need  to 
be  Included  for  adequate  spectra  simulation. 
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/  CAPTIONS  FOR  FIGURES 

FIGURE  I,    Normalized  plot  of  ^alf  of  the  derivative 
of  the. resonance  lineShape  for  various  ratios'  of  hfs 
to  AHq.    The  individual  components  consisted  of  four 
equally  intense  Lorentzian- shaped  lines.    The  values 
of  the  ratio  hfsrAH^  shown  are  0,  0*17,  0.34,  0.52,  ' 
and  0.69  in  order  or  decreasing  amplitude  in  the 
wings.  •  The  dotted  line  represents  a  pure  Gaussian 
lineshape. 

FIGURE  II.    Plot  of  the  ratio  of  the  calculated  line- 
width  (AH^a   )  to  the  individual  compan^t  line  width 
(AHp)  versos  the  ratio  of  hfs:AHp.    The  individual 
components  consisted  of  four  equally  intense  Lorent- 
zian  shaped  lines.    When  the  ratio  hfs:AHj>^  0.86  •the 
individual  component  lines  Just  begin  to  be  resolved. 

FIGURE  III >    Lineshape  of  the  Mu"0  component  in  the 
first  devlative  ESR  spectrum  of  the  benzene  anion 
radical  in  7:1  (THF-DME)  solvent  (0.3M  benzene)  at 
different  temperatures.    The  dashed  line  traces  the 
locus  oV  the  Lorentzian. 1 ine.    ^ 

FIGURE  IV.*    Plot  of  the  *^observed"  or  "calculated'' 
resonance  line  position  versus  the  "theoretical" 


Lorentzian  position  for  a  series  of  increasing  values^ 
of  the  metal  HfsrflHr  ratio.    The  effect  of  '^c 
is  not  included 


3? 


/ 

/ 


Hfs 

FIGURE  V.    -Plot  of  the  "observed"  or  "calculated" 

resonance>  1  ine  position  Versus  the  "theoretical"  f.  • 

Lorentzian.  po's\i tion  for  a  series  of  increasing  values 

of  the  metal  Hfs:AHp  ratio.    The  '^C  Hfs:AH-  ratio  for 

the  calculations  shown  here  is  8.   

FIGURE  VI »    Plot  oj  line  positions  in  units  of  peak- 
to-peak  half  Width  versus  the  metal  Hfs:AHp  ratio  for 
various  different. relative  heights.    The  effect  of 
13c  Hfs  is  not  incl'uded  in  this  plot.  / 

FIGURE  VII.    Plot  of  the  metal  hfs  versus  temperature  ' 
for  a  series  of  different  benzene  concentrations. 

FIGURE  VIII.    PTot  of  component  linewidth  (AHp)  versus         '  ■ 
temperature  for  a  series  of  differeri,t  benzene  con- 
centrations. 
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Evaluation  of  Complex  E.S.R.  Spectra  by  Computer 

•  %  * 

William  E.  Bennett,  Robert  E.  Buckles 

Department  of  Chemistry  . 
;  University  of  Iowa 

Iowa  City,  Iowa 


Introduction 

The  interpretation  of  complex  E.S.R,  spectra  can 
prove  difficult,  even  with  the  computer,  when  the„ number 
of  theoretical  lines  exceeds  a  few  hundred.  The 
primary  difficulty  is  the  fact  that  in^such  cases- -there 
is  considerable  accidjerttal  overlap  of  lines.    The  authors 
have  found  that  one  can  resolve  such  spectra  quite  easily 
by  using  simplifications  which  considerably  speed  the 
cbmput^ation.     If  the  computation  is  sufficiently  fast,  one 
may  try  hundreds  or  thousands  of  splitting  constants  in  a 
reasonable  length  of  time.    This  paper  is  a  report  of  the 
results  of  this  approach. 


•Method 

■    '  Vhe  simplest  description  of  an  E.S.R.  line' is  by  its 
height  and  by  its  Jfield , position  relative  to  the  center 
liite.  , 
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If  we  have  an  exact  accidental  ovprl'ap  of  two  lines,  .the 
^resulting  peak  will  have  a  height  vfhich  is  the  sUm  of  the 
two.    This  is  true  if  both  lines  have  the  same  width, 
Mich  is  generally  the  case  for ' organic  radicals.     If  the 
overlap  is  net  e^jact,  this*  is  not  ^true.    The  ajuthors  haver 
found;  however^  that  one  can  rapidly  eliminate  all  but  a 
few  of  the  possible  combinations  of  splitting  constants 
by  assuming'  that  overlapping  lines  sum  their '  intensities . 
Another  approximation  is  that  neighboring  peaks  do  not 
affect  each  other. 

The  experimental'  line  intensities  and  field 
positions  are  introduced ^as  data.     Guesses  of  splitting 
constants  or  of  incrementing  information  are  then 
supplied.  '  '  ^ 

•  « -f 

The  pro^gram  computes  the  theoretical  line  spectrum, 

summing  lines  which  €all  under  measured  peaks.    'If  no 

computed  •  .fines'  falL  near  ap  experimental  peak^  the 

splitting  constants  'are  rejected.    A  standard,  deviation 

for  the  fit  between  the  theoretical  and  experimental      .  ' 

spectrum^is  computed.     This  is' given  in  the  units  of  the  \ 

experimental  spectrum  (usually  cm  or  mm.).     If  the 

standard  deviation  is  less  than  a  value  supplied  with  the 

data,  a  print-out  of  the  experimental. and  computed  spectral 

line  intensities  is  given. 

This  first  phase  of  the  computation  narrows  the 
number  of  possible  combination  of  splitting  constants  from 
several  hundred  to  a. few.     *  ^  ^ 

The  second  phase  of  the  calculation  is  to  introduce, 
the  splitting  constants,  which *were  selected  by  phase  one, 
as.  data,  along  with' an  estimate  of  the  line  width.  Thisf 
phase^ computes ^a  theoretical  spectrum  assuming  a 
Lorenzian  line  shapes  '  Time  is  saved  in  that  the  intensity 

.need  be- computed  only  at-^the  peak  and  valley  of  tKe    *  . 

,  theoretical  line,  which  may  be  a  composite  of  many  lines'. 
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To  donserve  time,  the  line  shape  function  is  in  the  form 
of  a  table. 

A  standard  deviation  J-^areJji^ompu  This  second 

step  usually  narrows  the/sftile^ ion  ^of^^s^Dlitting  constants 
to  one  or  two  possibilitv£5^  | 

A  final  third  phase  is  to  refine  the  line  width  to 
give^  the  lowest  standard  deviation.     The  be^t  splitting 
constants  from  phase  two  are  used.     This  final  phase 
generaTLy  narVows  the  chpixre  to  one  set  of  constants. 
Plotting  the  computed  spectrum  verifies  this  further. 


Results 

^   Tetraanisylethylene  forms  a  radical  cation  which  has. 


1053  theoretical  lines.     The  observed  spectrum  is  given  in 
Figure  1  arid  has  89  lines.     The  regulai:  spacing  of  -the  lines 
indicates  that  the  three  splitting  constants  have  a  common 
factor  or  nearly  so.     The  spacing  is  about'  0.141  gauss. 
The  splitting  constant  can  then  be  found  by  incrementing  by 
the  spacing. 

The  first  phase  oJf  the-  computation  gave,  out  of  many 
hundred  increments,  several  possible  sets  of  splitting 
Constants  with  low  standard  deviations.    The  second  phase 
narrowed  this  to  two.     The  third  phase  gave  one.  The 
coJhputed  spectrum  Is  given  in  Figure  1 . '"'The  splitting 
constants  were:     8H      0.705  gauss,  8H  =  0.^20  gauss  and 
12H  =  0.844  gauss.     The  refined  width  was  0/078  gauss. 
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The  results  were  further  confirmed^  by  fitting  the 
spectrum  obtained  when  the  ethylene  carbons  wpre  enriched 
with  carbon  13. 

An  additional  program  which  the  authors  have  used 
subsequent' I  to  the  above  work,  preprocesses  the  experimental 
data  for  phase  on«.     For  this  program  a  line' width  is 
assximed.  /  An  approximation  is  made  of  the  intensities  the 
experimental  lines  would  have  with  very  narrow  width.  .This 
is,  in  effect,  a  resolution  enhancement  and  gives  a  more 
exact  fit  in  phase  one.    This  conditioned  spectrum  is  not 
used  in  phase  two  or  three. 
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Structure  Detertalnatloti  of  Organic 
Molecules  by  Mass'^Soectrometry 


K,  Blemajin 
Department  of  Chemja^xry 
Massachusetts  Institute  of  Technology 
Cambridge,  Ma,    ,  02139 


the  vast  amount  of  data  which  a  modern  mass  spectrometer  can 
prjjduce  within  a  short  time  has,  over  the  past  few  years,  forced 
th0  chemist  to  rely  more  and  more  on  computers  to  ensure  the  full 
and  efficient  exploitation  of  these  data.    The  techniques  fall  into 
three  general  categories:    The  high  speed  by  which  mass  spectra  are 
recorded  in  certain  situations  requtres  on-line  data  acquisition; 
the  large  volume  of  raw  dat^  (mainly  time-intensity  pairs)  reprints 
fast  and  efficient  processing  (to  mass-intensity  pairs,  and,  in  the 
c^se  6f  high  resolution  data,  further  to  elemental  composition- 
abundance  pairs.);  and  finally  interpretative  methods  to  identify 
the  compound  -^ving  rise  to  the  spectrum  or  to  aid  the  chemist  in 
thi9  interpretation. 


Historically,  the  first  instance  of  at  least  semi-automatic 
data  acquisition  and  full  scdle  processing  of  the  resulting  data 
on  a, large  computer  was  triggered-  by  the  desire  to  fully  exploit 
the  output  of  a  high  resolution  mass  spectrometer  used  in  the 
determination  of  structures  of  complex  organic  molecules (1)*. 


It  should  be  noted  that  computers  had  been  u8,ed  previously 
in  the,  computatiojis  required  to»  perform  routine  quantitative 
analyses  on  complex  mixtures,  in  which  the  input  data  were 
recorded  in^ct^nventional  form  and  manually  transferred  onto 
punched  c^rds  or  paper  tape. 
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The  accurate  measurement  of  the  position  of  a  few  hundred  lines 
.  representing  a  spectrum  recorded  on  a  photographic  plate  using  a 
Mattauch-Herzog  type  double  focusing  mass  spectrometer,  by  far 
exceeded  the  patience  of  a  densitometer  operator  and* led  to  the 
design  of  a  8emi**automated  itfensi tome ter .with  punch-card  output. 
Further  evolution  went  through  the  stages  of  digital  tape  ttecording 
(2)  and  finally  on-line  operation  of  the  densitometer  (3,  4\). 
Similar  techniques  were  developed  for  fast  scanning  spectrometers 
of  the  Nier-Johnson  Geometry,  which  do  not  posses?  a  focal  plime. 
These  instruments  thus  can  not  make  use  of. a  photographic  plate^^d 
must  recprd  on-line  into  the  computer  (5,  6).    The  primary  data  . 
obtained  in  either  case  is  a  parameter  related  to  the  mass  of  a 
particular  ion  giving  rise  to  the  signal,  as  well  as  its  abundance* 
The  conversion  of  this  parameter  to  mass  and  further  to  the  elemental 
composition  (the  interpretative  important  information)  of  the  ion 
will  be  discussed  later. 

A  second  area^ of  mass  spectrometry  in  which  the  data  rate 
requires  sophisti-cated  data  aquisition  techniques  involves  the 
operation  of  the  spectrometer  in  conjunction  with  a  gas  chrolaatograph. 
In  order  to  accumulate  all  the  data  necessary  to  identify  all  th^  , 
components  bf  a  complex  mixture,  a  complete  mass  spectrutt  has  to  be 
recorded  every  few  seconds  during  the  entire  gas  chromatogtam,  whit^ir 
may  last  1/2  -  2  hours. Theif  accumulation,  storage  and  utilization 
In  a  relatively  short  time  la  practically  possible  only  vlth  the 
aid  of  a  fast  computer  having  a  reasonable  atorage  capacity  and 
certain  sophisticated  (at  least  as  far  as  chemical  laboratories  go) 
output  devices.    Fortunately,  the  nature  of  the  compounds  and  the 
information  content  of  a  series  of  spectra  op  a  mixture  oft  compounds 
separated  by  gds  chromatography,  rather  than  a  single  substance, 
maKe  it  in  most  cases  unnecessary  to  determine  the  elemental 
composition  of  all, the  ions  and  one  can  get  by  with  their  nominal 
mass,  i.e.  the  dat^' provided  by  a  conventional,  single  focusing 
mass  spectrometer  (7,  8).  .  ^ 

As  alluded  to  above,  the  on-line  recording  of  a  mass  spectrum 
involves  sampling  the  outnut  of  t^e  electron  multiplier  of  the 
mass  spectrometer  (or  the  photomultiplier  of  the  densitometer) 
during  the  scan  at  a  rate  commensurate  with  the  accuracy  required 
for  the-mass  measurement.    In  high  resolution  mass  spectrometry 
one  wishes  to  determine  the  mass  with  an  accuracy  in  the  order  of 
one  part  per  million,  which  requires  high  sampling  frequency 
<10  -  50  KHz)  and  high  precision  with  respect  to  the  x-axis 
parameter  (time  in  the  case  of  on-line  scanning  of  the  spectrometer, 
or  distance  when  using  a  photografjhic  plate).    Since  the  peak  profiles 
are  guassian,  their  center  can  be  calculated  easily  by  the  usual 
methods,  such  as  interpolation  of  the  data  points  around  the  maximum, 
or  the  centroid.    The  degree  of  sophistication  with  regard  to 
thresholding  and  the  detection  of  incompletely  resolved  multiplets 
(due  to  different  ions  falling  at  the 'same  nominal  mass)  differentiates 
simple  approaches  useful  for  routine  spectra  f roj^  more  complex  ones 
produced  from  minute  amounts  of  material  of  complex  chemical  structure, 


\ 


2-38 


in  which  high  accuracy  in  the  measurement  of  the  mass  of  ions  of  low 
abundance  becomes  of  gre^t  importance. 

An  example  of  the  type  of  data  encountered  in  high  resolution 
mass  spectrometry  and  the  relationship  of  the  various  parameters  Is 
illustrated  in  Table  I, 


Table  I 


Position  of  line 
center  (i^  mass) 
measured 


Mass  calculated 
(in. mass  units) 


Corresponding 
elemental  composition 


173,4863 


484.2721 


Sl"36^2^3 


The  relationship  between  line  position  and  mass  i^  based  oti  the 
measurement  of  two  lines  due  to  ions  of  known  composition  (from  an 
Internal  standard)  and  lower  and  higher  mass  than  the  unknown.  The 
elemental  composition  is  cpmputed  directly  from  the  mass  found  to 
correspond  to  the  line  position  measurement.    It  is  that  combination 
of  elements  whose  sum  of  exact  atomic  weights  equals, the  empirically 
found  mass*    The  latter  has  to  be  determined  as  accurately  as 
possible,  to  assure  unambigous  assignment,  because  the  sym  of  other  *  • 
combinations  may  be  very  .close  (e*g*  the  mass  of  C^^H*»NaO^  is 
484.27258  whil^  th^it  of  C^^U^^U^O^  is  484,27123,  a"^"^  Jo,^^ 
difference  of  only  0.0013  mass  units).    The  same  relationship 
holds  for  the  relationship  of  the  time  at  which  a  pe^jK  maximum  has  to 
be  measured  in  the*  case  of  recording  on-line  with  a  ^^anhing  mass 
spectrometer.    The  time*  differences  between  the  peak  ^renter  of 
the  reference  ions  and  the  unknown  ion  has  to  be  determined  with 
an  accuracy  of  a  few  microseconds, 

A  molecule  consisting  of  so  many  atoms  fragments  upon  electron 
impact  in  the  ion  source  of  the  mass  spectrometer  into  a  large  variety 
of  fragment  ions  varying  greatly  in  abundance*    While  the  more 
abundant  ones  are  usually  also  of  structural  significance,  some  ions 
of  low  abundance  may  also  be  of  great  importance.    Thus,  the  mass  of 
all  detectable  ions  has  to  be  measured  accurately  and  th^ir  elemental 
composition  has  to  be  calculated..    Considering  the  large  number  of  . 
combinations  of  atoms  which  have  to  be  tested  for  each  accurate  masrs, 
a  fast  computet?  is  the  only  means  by  which. this  can  be  accomplished 
routinely.    Some  of  the  problems  encountered  in  this  work  dnd  their, 
solutions  have  been  summarized  ^previously  (9). 


High  resolution  mass  spectrometry  has  been  of  great  value  in 
the  determination  of  the  structure  of  certain  complex  organic  molecules 
available  only  in  extremely  small  quantities.    An  example  is  the 
recent  eluoidatlon  of  the  structu^ie  of  a  nucleoside  representing  a 
minor  constituent  of  phenylalanine  transf er-ribonucleic  acid.    The  u 
mass  of  the  ion  corresponding  to  the  intact  molecule  revealed  its 
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composition  to  be  ^16^20^6^5*    ^  iMijor  fragment  Ion  had  the  composition 
C.^II.qN«0  which  appeared  to  be  aromatic  (few  hydrogens)  and  indicated 
tne  presence  of  a  substituent  that  contains  the  remaining  atoms.  The 
arrangement  of  the  heteroatoms  in  this  substituent  could  be  deduced 
from  catain  ions  of  relatively  low  abundance  corresponding  to  th,e 
loss  of  some  of  the  functional  grbups  from  the  molecule.    These  data, 
in  conjunction  with  the  UV,  IR  and^ NMR  spectrum  of  the  substance,  of 
which  only  a  fraction  of  a  milligram  was  available  at  any  one  time, 
as  well  as  synthetic  models  and  biogenetic  considerations.,  led  to 
the  proposal  of  structure  I  for  this  biologically  significant  substance 
(10,  ID-      '  ^  '  , 


CO^CH. 
I   2  3 
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Another  important  area  was  alluded  to  earlier,  namely  the  use 
of  a  computer  to  record,  process  and  utilize  the  spectra  obtained 
frofij  a  mass  spectromg^ter  which  is  monitoring  a  gas  chromatograph. 
The  schematic  of  such  a  system  is  shown. in , Fig.  1.    It  Is  designed 
for  continuously  scanning  and  recording  approximately  AOO  spectra 
obtained  from  a  GC  rff laent  (7).    Datk  is  acquired ^during  the  up- 
scan  portion  of  each  .4  sec.  cycle  of  the  magnetic  field.  Peak 
centers  and  intetisity  data  . are  determined  in  real  time  and  the 
partially  reduced  data  is  transferred  to  disk  dilring'the  collapse 
of  the  magnetic  field.    Background  threshold  is  continuously  up- 
dated by  the  software  as  the  data  is  acquired.  '  In  this  Way  the 
loss  of  sensitivity  usually  associated  with  arbitrarily  set 
thresholds  involved  in  hardware  peak  detection  is  avoided  by  more 
fully  using  the  capabilities  of  the  computer. 


2-^0 

Presently  the  system  consists  of  a  Perkin-Elmer  990  gas  chromatography 
an  Hitachi-Perkln-Elmer  RMU-6L  single  focusing  mass  spectrometer^  and  an 
IBM  1800  Data  Acquisition  and  Control  System  using  interface  hardware 
and  software  developed  in  this  laboratory.    The  1800  is  configured  with 
32K  of  core  memory,  a  3-drive  1810  disk  system,  two, magnetic  tape  drives 
•  high  speed  digirtal  and  analog  input  and  output  ^capabilities,  .a  line 
printer,  a  console  typewriter,  'and  a  digital  plotter.    An  analog  tape 
system  has  been  interfaced  to  the  computer  and  to  the  various  low 
resolution  instruments  in  the  laboratot7.    This  provides  the  capability 
for  data  to  be  acquired  and  saved  for  subsequent  digitization  and 
processing  by  the  computer  at-a  later  time.    Finally,  the  most  recent 
addition. is  an  oscil](pscope-microf ilming  system.       ^  4* 

Improvements  in  the  scanner  ctfimitry  to  allow  a  more  repetitive 
computer-controlled  scan  function  and  major  changes  in  the  software 
system  have  permitted  faster  and  more  efficient  use  of  the  facilities 
by  laboratory  chemists^  These  faster  programs  have  resulted  in  ,^he 
increased  use  of  the  GC-MS-Computer  system  for  problems  in  areas 
such  as  lunar  crust  analysis,  carb"6hydrate  chemistry,  peptide 
sequencitig,  analysis  of  drug  abuse  problems,  geochemical  analyses, 
and  various  other  fields  of  organic  and  biochemical  interest.  With 
the  large  amounts  of  data  now  generated,  it  was  rjealized  that  the 
limiting  fac£or  was  the  relatively  slow  speed  of^  the  output  devices 
required  to  present  the  data  to  the  chemist.    Furthermore,  it  is 
desirable^  to  make  all  of  the  data  (not  just  the  spectra)  readily 
available  to  the  chemist  without  requiring  him  to  return  to  the 
computer  for  each  additional'^^iece  of  information.    To  meet  these 
goals,  a  new  and  faster  method  of  output ting  data  was  sought.  The 
microfilm  system  described  bellow  has  efficiently  solved  these 
problems.  '       .         -  " 

The  microfilming  system  CQnsists  of  a  Tektronix  611  storage 
oscilloscope,  a  Bolex'  16ram  H16M5  Cine  camera  with  a  10mm  Switar 
lens,  and  a  single-framing  control  mechanism  manufactured  by  In- 
dustrial Camera  Company,  Epsom,  N.H.    Tbe  plotting  and  filming 
are  totally  controlled  by  the  computer  via  analog  and  digital 
output  features.    The  computer  can  process  and  film  a  GC-MS  run 
of  AOO  spectra  in  under  10  min.  (including  the  time-to-mass 
conversion  step).    The^ilm  is  immediately  developed  using  a 
Kodak  Prostaf  Processor  and  is  thus  available  to  the  cljemist  with- 
in a  few  minutes  after  filming  is  completed.  .The  end  result  is 
a  finished  microfilm       a  carttldge  which  can  be  viewed  on  the 
microfilm  readers  in  1^h±B  laboratory  or  in  any  modern  library. 

The  data  ava^able  consist  then  of  a  few  hundred  consecutively 
recorded  mass  spectra.    Since  a  single  focusing  mass  spectroc\eter  is 
used  the  spect/ura  rej)resents  nominal  mass  (rather  than  elemental 
compositions)  .vs>  fi^bi^ndance  of  the  ions.    These  data  are  then 
presented  in  a  multitude  of  interpretativ^ly  meaningful  ways  (12) 
as  outlined  in  the  following  examples  from  work  on  the  organic  acids 
produced  upon  oxidiation  of  oil  shale  kerogen  (13). 

*  • 
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Sutnmlng  the  abundance  of  all  ions  in  each  spectrum  and  plotting 
the  sum  ¥s«  scan  Index  nuoiber  produces  a  gas  chromatogram  (Fig,  2  top; 
representing  wO  mass  spectra).    The  mass  spectrton  corresponding  to 
any  point  of  Interest  along  the  gas  chromatogram  can  be  presented  (on 
the  scope  or  plotb'ed)  by  the  computer.    For  example,  scan  No.  171,, 
shown  In  Fig.  3,  is  ^Interpreted  as  the  methyl  ester  of  the  acid  with 
JL7  carbon  stomas  (the  formula  Is  not  part  of  the  output,  but  arrived 
at  by  the  Interpreting  chemist,  or  by  the  computer  using  the  compar- 
ison technique  outlined  below).    Alternatively,  one  can  display  the 
relative  abundance  of  a  single  mass  throughout  the  gas  chromatogram, 
resulting  In  a  "mass  chromatogram"  as  shown  in  Fig.  2  for  mass  88 
(middle)  and  9^(bottom),  characteristic  for  a-methy^  substituted 
methyl  esters  Ad  methyl  esters  of  dlcarboxyllc  acids,  respectively. 


A  combination  of  such  display  techniques  and/or^  the  comparison 
of  the  mass  spectra  obtained  during  the  gas  chromatoj^ram  ^ith  a  file 
of  about  10,000  authentic  spectra  led  to  the  identification  of  the 
components  in  this  mixture.    Tliey  turned  out  to  be  unbrahched 
carboxylic  acids  with  8-29  carbon  atoms  (Cg-C^g),  Isoprenoid  acids 
with  15-22  carbon  atoms  (C^-C^2»  where  the  superscripted,  letter 
denotes  the  position  of  the  jElrst  met'hyl  substltuent)  gnd  mgthyl 
esters  of  dicairboxylic  acids  with  7-17  carbon  atoms  (C-  -     ->.  - 

The  comparison  technique  mentioned  above  involves  the  matching 
of  the  significant  features  of  the  spectrum  of  the  unknown  with  each 
one  in  the  collection  and  presenting  the  chemist  with  the  names  of 
the  compounds  exhibiting  the  most  similar  spectra  (lA).    It  is  up 
to  him  to  decide  whether  the  most  similar  one  Is  iqdeed  the  same 
compound  or  merely  the  most  closely  related  one.    The  final  result 
thus  rests  on  the  chemist's  interpretation  of  the  facts  presented 
by  the  computer.* 
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RESOLUTION.  OF  ELECTRONIC  ABSORPTION  SPECTRA  WITH 
,  .    .  .     LOGNORMAL  CURVES*  . 


•D,  B»  Siano,  J^,  A,  Thomson  and'D/E,  Metzld^ 
Department  oV Biochemitstry  and  Biophysics 
Iowa  State  University 

Ames,^  Iowa  '50010  ^      .  . 

INTRODUCTION  .     *  '  .        '  ,  ' 

Tlje  resolution  of  electronic  absorption  spectra 
by  curve-fitting  methods  has  largely  been  overlooked 
while  empl>asis  has  been  placed  on  spectra  in  the 
if-Raman'  region  (1)  •     In  the  \tetter  the  bands  to  be 
resolved  have  generally  been  considered  symmetrical: 
the  mo^t  popular  functions  chosen  for  fitting  in  a 
•least-squares  treatment  have  been, either  Gaussian  or  a 
combination  of  Gaussian  anjJ  Cauchy  (Lorentz)  curves  • 
Even  when  asymmetry  of  the  bands  wa^  recognized^  a 
""two  half-width"  ^Gaussian  was  appl-ied. 

The ^re solution  of.  spectral  bands  in  the  uv  region  - 
can  also  pifoyide  much  chemical  information.     In  addition' 
resolution,  provides  a* means  of  storing  information  cibout 
absorption  spectra  in  an  easily  accessible  library 
of^ uniform  format. 

THE  LOGNORMAL  FITTING  FUNCTION  *  ^ 

Close  examination  of  absorption  spectra  reveals 
that  ih  general,  individual  peaks  are  not  symmetrical.  " 
This  holds  for  spectra  taken  against  wavelength  or 
wavenumber,  but  the  asymmetry  is  more  obvious  on  the 
wavenumber  scale.    The  bands  are  generally  skewed 
towards  higher  energies.     The  so-called  "lognormal" 
distribution  has  beeh  found ^very  satisfactory  for  use 
in  resolution  of  elebtironic  spectra  (2). 

This  function  has  the  form 


e(v)  -  r~  exp  (-cV).exp    (-  ^  [in{^)]^\    ;  v 
v-a  >  i     2c  '  ^  ) 

and  c(v)  =  0;  V      a     •     The  molar  absorptivity,  is 
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a.ifunction  of  the  wavenumber,  v,  in  kiloKaysers  (kK)  , 
e'd )  the  maximiim  molar  absorptivity  of  the  peak  and  of 
the  four  lognormal  parameters  designated  a,  b,  and  c. 

Three  other  parameters,  termed  "practical"  in  the 
sense  they  may  be  estimated  directly  from  recorded 
spectra,  are  defined  in  Fig.  1.  ^  They  are  the  position 
^of  the  mode,  ^q,  the  half-width,  W,  and  the  skewness, 
p.    They  are  related  to  a,  b  and  c  by  the  expressions 

c  =  (jlnp)/(2jln2)^ 

b  =  W[p/(p^-l)]exp(c^) 

a  -  Vq  7  W[p/(p2-l)] 

The  area.  A,  under  the  curve  is 

A  -  Cq  W[(27T)^[p/(P^-iy]c  exp(c^/2)].  . 

For  a  single  spectral  band,  the  lognormal  function 
has  4Deen  found  superior  to  the  "two  half-width"  Gaussian 
and  provides  a  suitable  analytical .expression  for 
describing  many  spectral  bands  (2) * 

CURVE  RESOLUTION 

An  actual  spectrum  will  generally  consist  of        '  . 
several  bands  which  may  overlap  or  be  buried:  the 
problem  of  resolution  then' is  one  of .optimizing  many 
paraiAeters  to  obtain  the  "best"  fit.    Our  prograifts 
use  the  least-squares  approach  and  the  minimization 
technique  of  Fletcher  and  Powell  (3).     the  coding  for 
thii  method  is  available  "in  the  IBM  Scientific 
Sx±>routine  Packages  as  subroutine  FMFP.     To  use  this 
subroutine,  the- user  must  supply  another  subroutine 
(in  our. case,  DERIV)  in  which  the  values  of  the 
function  and  of  thp  error-squared  sum,  are  determined 
from  the  current  estimates  of  the  parameters  anS  in  ' 
which  the  gradients  of  the  parameters  are  also 
calculated.  *  . 

The  digital  .spectrum  (usually  as  molar  absorp- 
tivities)  to  be  resolved  is  read  in  along  with  initial 
guesses  of  the ^practical  parameters  for  each  band  ^ 
which  is  believfed  present.     The  corresponding  val\ies 
of  the  lognormal  parameter^ ^^are  calculated.  Control 
then  parses  to  subroutine  FMFP  which  immediately  calls  ^ 
subroutine  DERIV.     There  the  sum  of  the  difference 
at  each  wavenumber  between  the  experimental  and  computed 
spectrum  is  calculated.    The  computed  spectrum  is  a 
summation  of  the  current  theoretical  values  of  all  the 
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individual  lognormal  bands^    The  derivatives  of  the 
lognormal  with.regpect  to  the  parameters  are  also 
calculatexi  from  the  appropriate  analytical  expressions 
and  are  used  to  evaluate  the  gradients  of  the 
parameters.     The  iterations  continue  in  FMFP  and  DERIV 
until  either  donvergence  is  sensed  in  terms  of  certain- 
\Jser-supplied  values  or  a  given  number  of  iterations  is 
completed,  .  Control  returns  to  the  main  program  and 
the  final  values  of  the  parameters  and  of  the  gradients 
together  witl^,  a  comparison  between  the  observed  and 
calculated  spectrum  are  printed, 

Ni  The  output  also  includes  two  computer-prepared 
plots:     the  first  is  a  trace  of  the  confuted  spectrum 
with  the  actual  data  points  superimposed  and  the • second 
is  a  plot  of  the  differences  as  a  percentage  of  band 
height,    ^  •  . 

This  program  h^s  worked  well/  but  as  the  complexity 
of  the  spectrum  Increases  /  the  program  bect&mes  somewhat 
inefficient.     Experience  showed  gpod  results  wei^e 
obtained  if ^on^y  two  or  three  peaks  were  considered 
at  one  1;ime,  .  Therefore /•  the  program  was  modified  to 
consider  only  part  of  the  spectrum  at  a  time.    The  new 
program  employs  a^  "sliding-window"  to  control  the  number 
of  peaks/  parafaeters  and  segments  of  the  spectrum  con- 
sidered.   The  lowest-energy  peak  which  contains  at 
least  one  parameter  to  be  adjusted  is  located  and  this 
serves  as  the*  "l?ase-peak'*-  of  the  window  for  this 
particular  sequence.    The  relationship  of  this  peak 
to  its  two  higher-energy  neighbors  is  examined  and  a 
window-width  of  one,  two  or  txiree  peaks  determined. 
Then  only  the  parameters  to^ actually- be  varied  are 
passed  to  £he  minimization  subrovitines.     Only  the 
portions  of  the  spectrum-  within  the  window  are  made 
available  to  the  derivative  subroutine. 

Upon  return  to  the  main  ptogram,  the  adjusted 
parameters  of  the  base  peak  are  fixed  so  they  remain 
constant.     In  further  calculations  any  significant 
•contribution  of  this  peak  and  similar  "fixed''  peaks 
to  the  spectra  within  the  window  are  considered  in 
the  theoretical  summation.    The  program  now  moves  the 
window  to  a  new  base  peak  at  higher  energy.  The 
iterations  continue  until  all  peaks  have  been  con- 
sidered.   The  greater  complexity  of  this  coding  is 
more  than  offset  by  the  increased  efficiency  in 
subrouti/ies  FMFP  and  DERIV, 
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EXAMPLES  .      '    •  /  , 

Potassium  Iodide.    The  peak  of  the  charge-transfer- 
to-solvent  spectrum  of  aqueous  iodide  ion  at^  44*2  kK  has 
been  fitted  with  a  Gaussian,  and  the  upper  portions  have 
been  tested  and  found  symmetric  (4) .     However,  an 
excellent  fit  with  slightly,  skewed  lognormal  curves  is 
obtained  as  shown  in  Fig.  2.     The  maximum  error  is  ' 
0.7%  of  the  band  height  and  this  occurs  in  the  yalley 
at  48  kK.    The  skewness,  p,  is  1.05  for  both  bands. v 
Although  the  sk^wness  is  small  it  is  easily  measurable  ^ 
(for  the  44.2^  kK  band)  and  it  is  clear  that  the  l^and  is 
not  truly  Gaussian.     For  the  Gaussian  p  =f  1,  and  it  ,  can 
be  shown  that  as  p      1.0,     the  lognormal  approaches 
the  Gaussian  (2) . 

5-peoxypyridoxal .    An  example  of  a  more  complex, 
spectral  resolution  is  shown  in  Fig.  3.     A  number  of  ^" 
overlapping  peaks  have  been  resolved  successfully,  but  , 
this  was  possible  only  by  assuming  values  of  W  and  p 
for  some  of  the  bands  (based  on  comparisons  with  related 
substances)  and  fixing  these.     The  figure  illustrates 
another  feature  of  the  program,  the  tracing  of  individual 
lognormal  bands. 

3-Hydroxypyridine .     Resolution  of  spectra  can  be 
used  to  advantage  to  describe  effects  of  changes  in 
solvent  or  temperature  on  spectra.     Resolved  spectra 
of  3-hydroxypyridine  in  water  and  in  30%  methanol  are 
shown  in  Figs.  4  and  5.    As  indicated  in  Fig.   4,  the 
neutral  form  of  the  molecule  exists -as  a  tautomeric 
mixture,  the  equilibriym  position  of  which  is  strongly 
influeaced  by  solvent  composition.    This  shift  in 
equilibrium  is  reflected  clearly  in  the  spectrum. 
The  first  (low  energy)  band  (band  I)  represents 
the  zwitterion  (z)  while  band  II  represents  *  the 
uncharged  form  (n) . 

Much  experimental  work  in  this  laboratory  (5) 
indicates  that  molar  band  areas,  d^,,  remain  constant, 
with  changing  solvent  composition  (and  temperature) . 
This  beiny  the  case,  the  tautumerization  constant, 
K^,  in  a  solvent  may  be  evaluated  from  curve  resolution 
data  using  the  following  equation: 

_    z  _      z^     z      _  ^     ^n  .  z 

^        a  /a-     ^  t,a  a 

n^     n  z  n 


Here  f^  and  fj^  are  the  fractions  of  zwitterion  and 

uncharged  forms,  respectively;  fg  +  f^  =  1.  Cl^  and 

Q^i         the  measured  areas  of  bands  I  and  II,  respectively. 
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FIGURE  3 

RESOLUTION  OF  A  COMPLEX  SPECTRUM 
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FIGURE  4 


RESOLUTION  OF  THE  SPECTRA  OF  A  TAUTOMERIC  MIXTURE 
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FIGURE  5 


DIFFERENCE  PLOTS  OF  RESOLUTIONS  OF  FIG.  4 

Peaks  maxima  are  indicated  by  arrows. 
The  bottom  curve  is  fot  the  spectrum  in  0%  piethanol. 
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and'A^    and  t^O^  are  the  changes  in  those  areas  when 
the  equilibrium^   position  is  perturbed^  e.£.  by  a 
change  to  30%  methanol.    Prom  the  data  of  Fig.  4 
we  calculate 

=  0.76  *±  O.L  in  water  at  25^C.  / 

f 

The  misfit  in  both  spectra  at  44  kK  Ts.  pbvious, 
but  its  cause  is  less  so.    We  have  deliljeratSly  •  ^  ^ 
^weighted  the  points  in  the  valleys  only  oile^-tenth  as 
heavily  as  the  points  around  the  peaks.     This  insures 
a  good '^f it  around  the  peaks.    Since  for  soin^  confounds 
we  obtain  good  fits  in  the  valleys  also^  it*xs  possible 
that  an  additional  weak  absorption  band'  is  present  in 
the  spectra  of  Fig.  4.  •    -  ^     ^  .  * 

The  goodness  of  f it *ia  graphically  shown  in. Fig.  5 
which^^s  a  plot  of  the  percent,  etror  of  the  two 
3-hydroxypyridine  spectra.    The  peak  positions  havdk 
been  marked  by  small  arrows.    The  sraall  shifts  in  ' 
b^nd  positions  induced  by "the  solvent  change  are 
readily  agjparent  and  there  is  a  possibility  that  som$ 
of  the  pattern  seen  arises  from  the  existence  of 
characteristic  vibtonic  fine  structure.    This  seems 
especially  likely  around  peak  11^  for  a  recurring  similar 
pattern  is  seen  in  many  spectra.  *  ^  -  ' 

Benzene.    The  fine  structure  of  the  low  ener^ 
band  of  benzene  in  methanol  is  clearly  displayed  by    '  * 
fitting  a  lognormal  curve  and  ex2u:tiining  the  difference 
plot  (Fig.  6).    This  approach  can  also  be  used  to 
.display  fine  structure  in  spectra  of  proteins  or 
nucleic  acid  and  a  way  of  easily  examining  effects, 
of  perturbations  by  solvent  or  teit^erature  chcinges. 

DATA  COLLECTION  AND  PROCESSING 

,  ,  The  spectra  in  this  paper  were  collected  in 
digital  form  on  cards  punched  at  regular  wavenumber 
intervals.    A  Cary  1501  spectrophotometer  was  equip'ped 
with  a  Datex  digitizing  syste\n  which  is  interfaced  to 
an  IBM  026  key  punch.    The  absorbance  information  is 
sensed  by  a  shaft  encoder  by  specified  "wavenumber 
interval  commands"  and  is  punched  on  the  cards  in  a  .  ^ 
coded  format.    A  number  of  programs  have  been  written 
to  process  the  raw  data.  • 

Initially  the, Datex  code  is  converted  to  "real" 
absorbances  and  baseline ^  and  (if  required)  .turbidity 
corrections  are  made.    The  program  allows  great  varia- 
tion in  spectra-baseline  combination^ including 
averaging  6f  repeated  scams.    The  spectra  may  terminate 
at  different  high  energy  wavenumbers^.    Small  adjustments 
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for  instriamental  drift  may  be  made  automatically.  At^i^ 
temperature  corrections  are  applied^  input  concentratipns 
may,  be  used  to  produce  decks  of  apparent  molar  &bsorp- 
tivities.    These  decjcs  are  then  analyzed  by  other  progreuns'^ 
to  extract  equilibrium  constants  and  spectra  of  individual* 
ionic  forms  of  the  compounds  (6)  .    The  progreuns  feature 
the  use  of  graphical  output  emd  comparison 't?f  experi-  ' 
mental  and  calculated 'data. 

A  manual  describing  the  complete  package  of  programs 
is  available  on  request.    All  programs  are  written  »f6r 
the  IBM  360/65  at  lowa^  State  University  in  FORTRAN  IV 
usin^this  installation '.s  platting  progreim  SIMPLOTTER 
on  a  CALCOMP  470  plotter." 
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REAL-TIME  ANALYSIS  OF  ULTRAVIOLET  ABSORBANCE  SPECTRA  0^*  ONE 
TO  FIVE  DNA  SAMPLES,  USING  CLASS >  A  SPECTROSCOPY -ORIENTED 
LANGUAGE  ON  AN  8K  MINI-COMPUTER 

Davi.d  C,  Harris 
Institute  of  Molecular  Biology  and  Departmelat  of  Chemistry 

University  of  Oregon 
^  Eugene,  Oregon,  97403 

Oveirview  ^ 

The  absorbance  of  ultraviolet  light  at  260  nm  has  long 
been  used  to  study  the  condition  of  double  stranded  DNA  and  its 
interactions  witTi  other  molepules.    But  existing  theory  indicates 
that  different  nucleotide  sequences  should  have  characteristic 
spectra,  and  that  if  one  knew  the  spectruip  for  each  possible 
interaction,  he  might  deriVe  a  we^alth  of  information  fr^>m 
analysis  of  accurate  spectral  measurements.    Rather  short 
nucleotide  sequences  .are  being  studied  by^  circular  dichroism  to 
elucidate  the  sources  of  observable  spectra*  (Cantoir  et  al« ,  , 
1970;  Warshaw  and  Cantor,  1970)*    We  are  worki^ig  on  relatively, 
large  DNA  molec*ules,  u^lng  an  empirical  but  theoiretically 
reasonable  method'  to  analyze  the  spectra.    We  have  developed  ^ 
system  using  an  8K  Varian  620/1  comput;er  .and  the  'striftg-processor 
language  CLASS  to  control  the  position  of  a  fiye-holder  sample 
turret,  alter  the  teiiiperature,  and  initiate  scaiis  automatically' 
on  a  Gary  14.    We.  read  percent' transmittance,  time,  and 
temperatyre  to  high  accuracy,  puticb  a  paper  tape  report  of  the 
spectrum,  arialyze^the  measurements"  m'ade,  and  report  the  most' 
useful  results  ^  we  can  act  ^to  chknge  the  course  oi  the 
experiment  if  desired'.    The  report  can  ,be  irt  characters  on  a-  ^ 
teletype,  j^raphjlcally  on  a  plotter,  or  on  a  storage  oscilloscope.  * 
The  information  reported  may  easily  be  varied  from  experiment;  to 
experiment  as  our  chemical  questions  thange.    After  an  experiment 
we  transcribe  the  paper  tape  informatidn  to  magnetic  tape  and 
analyze  ±^  more  thoroughly  on. an' IBM  360/50  II,  whloh  has  greater 
storage  capacity^ and  sophisticated  plotting  facilities. 

1  wish  to  acknowledge  that  Dr.  William' Melchior^  James  McGhee, 
Loren  Dotsen,  and  Chris  Meyers  did  the  major  share  qf  the 
hardware  development-  descioed  in  this  paper.    I  am  grateful 
to  Dr.  Charles  Klopfenstein  for  advice  and  instruction  in 
programming  an^* instrumentation  techniques,  «nd  to  Dr.  Peter  H. 
von  HippeL  for  generous  supp.ort  and  guidance  in  the  physicJal 
chemistry  of  DNA*    During  the  time  this  research  was  in  progress 
my  stipend  was  provided  by  NIH  .Training  Grant"^ PHS2T0 100715-12,  . 
research  support  was^provided  by  grant  PHS5R01GM15792'-05  (to 
P.  von  Hippel),  and  ba^ic  computer  facilities  were  provided 
by  PHS~HSAA5S04tFR06027--02. 
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Theory 

Briefly,  the ^spectral  aflalysis  method  we  are  using 
(Felsenfeld  and  Hirschman,  1965;  Hirschman  and  Felsenfeld, 
1966)  assumes  that  hypochromism  in  DNA  arises  from  interactions 
between  base  pairs  in  a  double  helix.    Thus,  an  adenine-thymine 
(AT)  pair  next  to  a  guanine -cytosime  (GC)  pair  has  one  spectrum, 
AT  next  to  AT  a  different  spectrum,  and  so  forth.    If  the^arious 
possible  combinations  are  treated  as  randomly  ordered  along- the 
helix,  one  can  deal  with  all  interactions  as  being  due ''to^three 
kinds  of  "average"  interaction:  AT  next  to  AT,  GC  next  to  GC, 
and  XT  next  to  GC,  •  Since  randomness  is  assumed,  if  one  knows 
the  fraction  AT  in  a  simple  he  can  calculate  the  fraction  of 
each  kind  of  interaction  present.    Felsenfeld  and  Hirschman  als6 
introduced  a  parameter  delta  to  repr.esent  small  deviations*  from 
randomness,  so  that  a  positive  delta  indicates  AT  pairs  being 
neighbors  to  AT  more  often  'than  predicted  by  complete  randomness. 
They  assumed  that  in  a  variety  of  DNA  samples  the  variatiotis  in 
degree  of  randomness  would  cancel  out.    That  is,  delta  is  zero 
overall.    They  measured  the  spectra  of  11  kinds  of  DNA,  of 
various  fraction  AT,  and  used  least  squares  curve  fitting^  to 
determine  the  spectra  of  the  three  "average"  interactions.  This 
was  done  for  the  helical  native  DNA  and  for'  the  same  samples 
when  completely  denatured.    The  numbers  measured  were  incorporated 
into  a  set  of  convenient  algorithms.    One  measures  4-15  optical 
densities  on "a  sample  under  the  appropriate  conditions,  and 
calculates  the  total  concentration  of  nucleotides,  the  fraction 
AT,  and  the  estimate  of  randomness,  delta.    This  can  be  done  on 
a  natiye  sample  or  a  fully  denatured  saniple,  and  the  method ^has 
frequently  been  used  to  establish  concentration  and  nativeness 
of  DNA  samples,  usii^  manual  or  desk  calculator  computations. 
One  can  also  determine  the  absorbance  changes  produced  by  partial 
denaturation,  and  use  them  to  compute  the  number  of  base  pairs 
which  denatured  since  an  experiment  began.    Thus,  one  can  follow 
the  melting  of  AT  and  GC  pairs  separately,  even  though  they  are 
present -in  the  same  molecule.    We  are  using  this  separated 
melting  curve  technique  to  study  the  effects  of  various  aqueous 
solutions  on  the  two  kinds  of  base  ^air.    A  iSl^lecule  that  binds 
preferentially  to  regions  high  in  AT  pairs  may  stabilize  this 
region,  and  we  will  see  the  effect  as  a  higher  melting 
temperature  for  the  AT  curve.  .  . 

\ 

Hardware  '  (  \ 

Figure  1  shows  the  basic  contponents  of  our  system.  The 
central  object  is  our  Spectrophotometer  Interface  (SIF),'a 
modified  Gary  Instruments  option.    The  SIF  is  contrdlled'^-^s  two 
separate  devices.    The  relay  driver  controls  a  box  of  power  relays 
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We  use  some  relays  to  control  an  Automatic  Sample  Changer.  Our^ 
five-sample  turret  is* modified  to  permit  reversing* the  direction 
of  turn  so  as  not  to  wind  up  the  thermocouple  which  goes  into, 
one  cvv^tte..   Other  relays ^control  a  modified  temperature 
programijier •    This  programmer  drj^yes  the  thermostat  on  a  heating 
bath  which  continually  circulates'  ethylene  glycol  to  the  *       ^  ^ 
multiple  sample  tu^rret.    We  considered' controlling  the  heating 
coil  in  the  'bath  from  the  computer  but  decided  the  bath  dqaign' 
probably  would  not  allow  us  increased  stiibility  under  direct 
computer  control.    The*  refrigerator  was  wired  to  allow  on-off 
and  high  or  low*,  cooling  rate  selection.    Finally,  we  use  the 
relay  box  to  statrt  the  Cary  Repetitive  Scan^option  that  drives  . 
the  monocbromator  over  a  mechanically  set  range  of  wavelengths. 
The  wavelength  counter  in  the  SIF  keeps 'track  of  the  monocbromator 
mechanism  to  O.Oiiim.    A  portion  of  th^  amplified  phototube  signal 
goes  to  one;t^rmin^l  of  a  multiplexed  ADC  with  selectable  gain. 

The  mecKaijical- drive  of  the  moivochroiftatOT  cannot  stop  * 
precisely  at  a  particular  wa<relength.    So  we  begin  measuring 
percent  transmittance  a  little  above  e^ch  desired  point  and 
average  all  readings  (at  the  rate  of  3D  readings/second)  until 
the  monocbromator  is  an  equal  distance  below  the  intended 
.^^av^length.*    Increasing  the  width  of  this  "wavelength  filters- 
reduces  .noise  effects  by  increasing  the  number  of  readings - 
^averaged*    Except  on  an  absorption  plateau,  an  increased  f ilter  i 
width  -also  includes  more  re^adings  unrepresentative  of  the^  y 
meastirement .    We' currently  use  ±  1  nm,  which  allows  repeated 
scans  t^en  at  2.5  nm/sec  scanning  rate  to  have  a  corrected 
standard  deviation  of  0.001  O.D.  units  for  readings  below  an 
optical  density  of  1.    We  collect  such  readings  every  5  nm  over 
th&*  range  350-220  nm.    We  can  measure  temperature  to  about  0.1 
degree  Q  uaing  the  thermocouples  and  calibration  tables  stored 
in  the  computer.    We  routinely  measure  time  to  1/10  second  wifh 
the  real-time ■ clock.    We  , can  punch  paper  tapes  on  a  teletype 
punch*  at.  10  characters /sec  or  on  a  fast  punch  at  120/sec. 
Aft^r  analyzing  a  spectrum  we  report  the  results  -on  a  teletype 
or  stbrage  oscilloscope.    We  did  simple  wiring -to  allow  the  DAC 
^nd  relays  insj,ae  the  SIF  to  cdntrol  an  dnalog  strip  chart 
rec'order.    Thus  Ve  can  create  simple  graphical  plots  while  an 
experiment  occurs*. 

Software  ' '  «•  ' 

CLASS*  (Conversational  language  for  Spectroscopic  Systems)  ^ 
is  Cary  'Instruments*  st;ring  processor  language  for  their  group 
o.f  spectroscopic  instruments.    Since  this  language  is  rather  ^ 
^^different  from  languages  commonly  used  in  batch  calculations  on 
large  computers,  we  will, 'discuss  it  here*    Fig^^e  2  is  an  abstract 
schematic  of  the  language,  and  figure  3  is  a  representative 
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program  segment.    About  3K  of  the  memdry  is  occupied  by  the  CLASS 
run  monitor  and  related  subrputines.-    Programs  are  entered  by 
typing  commands  which  design^^e  particular  operations •  These 
commands  are  stored  mainly  "as. : a.  string  of  8  bit' bytes.  During 
execution  the  run  monitor  g^ts  a  byte  and  transfers  control 
temporarily  to' whichever  routiite 'does  the  desired  operation. 
That  much  is  rather  like  a  S|&cohd  level  of  machine  language.  An 
important  feature  is'<  that  the  contents  of  memory  are  "floated". 
That  is,  information  is 'moved  up  and  down  in  memory,  and  some 
addresses  are  altered  to  allow,  insertion  and  deletion  of  routines, 
prpgrams,  and  data.    A  "transfer  vector"  of  addresses  is 
maintained,  so  one  can  get  to  .any  desired  routine  by  doing  an 
indirect  jump  through  the  tratrv^fer  -vector.    This  strategy  assures 
complete  utilizatioh  of  memotfy  space.    By  limiting  the  number  of 
operation  routi»nes  to  fewer  than  150,  and  using  bytes  for  coding,. 
CLASS  makes  program  code  very  cj&rapact.     CLASS  saves  more  space  by 
frequently  using  the  same  opet^tion  routines  to  manipulate 
spectral  measurements  and  program  instructions.    After  all,  both 
are  }.inearly  arrange4  and  sequentially  accessed*    Because  there  is 
no  compile  step,  CLASS*  functioiis  conversationally 'and  program 
modification  is  fairly  easy.  ' . 
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CLASS  has  a  set  of  "macro"  commands  for  tha  most  common 
operations,  lik^  taking  a  spectrum,  subtracting  a  baseline, 
printing  a  label,  etc.    Naming  a  macro  causes  the  appropriate 
bytes  ,to  be  moved  from  a.  macro  library  into  the  program* 
string.    Macros  being  moved  can  request  the  programmer  to  enter 
the  particular  values  needed  for  parts  of  the  program  that  are 
variable,  such  as  the  wavelengths  to  be  scanned,  where  to 
store  the  readings,  etc,    A  programmer  can  write  useful  macros 
for  his  own  common  operations  and  put  thewn  in  the  macro  library 
using  a  macro  compiler  feature.    Recent  versions  of  CLASS 
have  added  multiple  program  strings,  inter-  and  intra-string 
'conditional  jumps,  byte  manipulations  and  floating  point 
arithmetic.    The  jumps  allow  us^to  write  program  string 
subroutines.    The  various  levels  of  subroutine  allow  us  to 
write  highly  modular  code  for  easy  modification.  Since 
modularity  increases  the  possibility  that  something  \}±11  be 
missing  when  execution  is  attempted,  CLASS  provides  a  number 
of  error  messages*  which  often  indicate  what  was  wrong  or 
missing,    CLASS'  interactions  with  the  various  instruments  are 
isolated  in  a  few  operation  routines  that  read  or  drive  the 
peripheral  hardware.    Most  I/O  occurs  through  th^  registers  and 
the  push-down  stack.    The  stack  is'  also  where  all  arithmetic  is 
done,  using  Polish  notation  sequences  of  operations  on  the  top 
two  locations, 

CLASS  has  a  featCire  to  make  handling  spectra  convenient: 
the  sentence,  '  A  sentence  is  all  program  code  between  two 
periods  (,),    If  a  sentence  contains  any  operation  that  should 
be  applied  to  every  datum  of  a  spectrum,  such  as  putting  the    -  « 
sum  of  two  spectra  into  a  third  data  array,  the  run  monitor 
tluses  the  sentence  to  be  reexecuted  once  for  every  point  in  the 
spectrum.    The  sentence  is  somewhat  like  the  FORTRAN  BO  loop,  but 
the  programmer  need  not  know  the  number  of  points  in  the  spectra  ' 
being  handled. 

The  main  price  of  these  advantages  seems  to  be  speed, 
particularly  in  arithmetic  calculations.    For  most  chemical 
instruments  this  is  no  problem.    Since  we  do  extensive 
arithmetic  in  the  spectral  analysis,  we  occasionally  see  delays 
due  to  calculations.    Our  monochroihator  mechanism  takes  a  piinute 
to  rewind,  jso  we  have  sufficient  time  for  all  calculations,  I/O, 
and  control  operations.    After  we  collect  and  store  baselines, 
a  typical  program  might  ^1)  position  the  desired  cuvette  in  the 
light  path,  2). start  the  monochromator  and  measure  time,  percent 
transmittance,  and  temperature,  3)  punch  a  paper  tape  of  the 
b^sic  measurements,  4)  increment  counters  of  spectra  taken, 
5)  convert  percent  transmittance  to  absorbance,  6)  adjust  for 
the  effect  of  solution  expansion,  7)  subtract  a  baseline,  8) 
analyze  the  spectrum  and  store  the  results,  9)  compute  and 
report  the  desired  function  of  the  basic  results,  10)  control 
the  bath  "temperature  and  11)  increment  the  cuvette  position 


counter  before  retuming'to  step  1).    The  systenj  can  run 
unattended,  but  we  attempt  to  watch  for  unexpected  results  that 
require  further  observation  or  a  program  change.    Figure  4 
represents  the  kind  of  printed  report  we  can  produce. 
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The  third  line  of  the  report  is  the  result  of  the  two  term 
analysis,  a  computationally  simpler  approximation  to  the  full 
spectral  analysis. 

Figure  4 


For  the  Future 

Our  current  system  has  two  distinct  weaknesses^  all  the 
paper  tape  is  bulky  and  inconvenient,  and  the  large  computer 
processing,  a  straightforward  batch  calculation,  does  not  occur 
until  •after  th6  experiment  is  finished.    Dr.  Klopfenstein  of  * 
the  Oregon  Chemistry  Department  has  discussed  the  possibility 
of  a  "data  sink"  system.    One  computer  with  a^'magnetic  disk 
unit  would  be  centrally  located  and  connected  by  permanent  wires 
to  terminals 'in,  most  rot>ms.    We  could  connect;  to  the  nearest 
terminal  and  send  spectral  readings  to.  the  "sitik^^^much  faster 
than  paper  tape  can  be  punched.-   Then  we  would  dump  our  readings 
from  the  disk  onto  a  magnetic  tape  compatible  with  the  IBM  360. 
This  may  never  occur  because  of  an  evan  more  promising 
possibility:  mini-computer  communication  with  a  recently 
acquired  time  sharing  PDP-10  computer.    We  will  not  need  paper 
or  magnetid  tape  when  we  can  send  information  to  this  Targe 
cpmputer.    Such  a  system  would  make  elaborate  analysis^ feasible, 
with  the  results  returning  in  time  for  experimental  control. 
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Applications  of  Computers  to  Structure  Determination  and  Kinetics 


Introduction 


Walter  C*  Hamilton 


Chemistry  Department, 'Brookhaven  National  Laboratory^ 
Upton,  New  York  I1973I 


Westheitoer  has  defined  the  three  main  areas  of  concern  in 
chemistry  as  synthesis,  structure,  and  dynamics.    During  this 
conference,  we  have  heard  of  the  many  contributions  that  computers 
have  made  in  each  of  these  areas.    The  papers  in  this  Session 
represent  a  few  of  the  many  existing  applications  to  the  latter 
two  areas — structure  and  dnnamics.    In  fact,  most  of  the  papers 
±n^  the  other  sessions  are  directed  toward  these  areas  in  one  way 
or  the  other.    Today's  papers  provide  examples  of  each  of  the 
several  ways  in  which  computers  have  had  an  impact  on  chemistry 
and  on  the  ways  of  doing  chemistry:    large-scale  digital  computa- 
tions, on-line  experimental  control,  and  chemist-computer  inter- 
active coupling. 

.  The  use  of  computers  in  structure  determination  work  has  grown 
in  many  areas,  but  the  most  dramatic  effect  has  perhaps  been  .in 
crystallography.    Because  of  his  great,  easily-formulatefl  needs,  the 
crystailographer  was  the  first  major  user  oi^he  modern  digital 
computer.    It  is  no  accident  that  one  of  the  authors  of  F0KrRAN  was 
a  crystailographer  aild  that  former  crystallographers  often  hold 
important  positions  In ^computer  systems  developments    Our  first 
invited;  paper  in  this  session  describes  something  of  the  impact 
that  the  computers^  have  had  in  this  field  and  of  how  the  chemical 
crystailographer  uSes  the  computer.    The  reftiaissance  in  inorganic 
chemistry  and  the  r^gid  propagation  of  the  ^aith  in  molecular 
biology  have  both  been  largely  due  to  the  kind  of  crystallography  ^ 
that  computers,  have  made  possible., 

»      -         '  '  .  ** 

Following  Professor  Ibers*  Contribution,  we  have  two  short 
papers  on  qufte  different  applications  of  computers  to  structural 
investigation.    The  first  contributed  paper  describes  a  correla- 
tion between  optical  spectra  and  stereochemistry  which  has,  been 
explored  by  model  computer  calculations.    The  speed  with  which 
the  computer  can  explore  and  test  hypotheses  in  a  multiparameter 
space  is  here  the  k^y  to  its  application. 

The  second  contributed  paper  shows  how  computers  have  been 
used  for  data  acquisition  and -experimental  control  in  microwave 
spectroscopy,  one  of  the  many  fields  where  the  computer  has 
become  an* important  piece  of  chemidal  equipment. 

We  continue  to  explore  the  use  of  computers  in  experimental ^ 
control  and  man-machine  interaction  as  we  turn  to  the  invited  paper 


in  chemical  kinetics  by  Professor  Wilson..    Both  kinetic  experiment 
and  theory  have  benefitted  from  the  use  of  the  computer,  and  in 
the  two  ^contributed  papers    we  hear  of  the  use  of  computers  in  the 
analysis  of  experimental  rate  data. 

It  is  also  worth  noting  that  Professor  Wilson  and  others 
have  contributed  greatly  to  the  success  of  this  conference  and 
to  the  educational  use  of  computers  by  their  elegant  computer- 
produced  films  of  reacting  atoms — real  kinetic  art. 

The  important  point  about  the  use  of  computers  in  both  of 
these  areas  is  that  more  ch^ists  are  able  to  do  more  interesting 
chemistry  because  the*computer  can  do  many  things  much  faster 
than  can  a  chemist — hopefully  leaving  the  chemist  mpre  time  to 
think  about  the  meaning  of  his  .result;^- 


THE  USE  OF  COMPUTERS  IN  CRYSTALLOGRAPHY' 

♦ 


James  A.  Ibers,  Department  of  Chemistry 
Northwestern  University,  Evanston,  111.  6o20l 


Introduction 

For  the  purpos^^s  of  this  paper  I  mean  by  the 
term  "crystallography"  the  determination  of  the  struc- 
tures of  crystalline  solids  based  on  data  obtained  by 
diffraction  methods  from  single  crystals. 

The  general  technique  of  single-crystal  struc- 
ture determinations  has  undergone  a  marked  increase  in 
importance  in  the  last  decade.    A  glance  at  any  number 
of  chemical  journals,  for  example  ^Inorganic  Chemistry, 
will  reveal  that  this  technique  is  at  the  heart*  of  per^ 
haps  15-50^  of  the,  papers  in  a  given  i^sue.    Moreover,  - 
If  one  follows  the.  trends  that  have  o6cur3:ed  over  the  . 
last  ten  years  it  is  clear  that  more  people  are  now 
doing  larger  structures  more  accuratelv.    There  are 
two  main  reasons  for  these  changes:  (1)  the  aVail- 
al^ility  of  fast,  number-crunching  computers  and  (2) 
better  Instrumentation.    In  the  'instrumentation  area, 
^the  crystallographer  is  making  extensive  use  oT  small>. 
control  computers,.    These  are  thus  the  two  aspects  of 
computers  in  crystallography  that  I  wii;L  discuss. 

Number  crunching 
*# 

Turning  first  to  straight  number  crunching,  the 
crystallographer  Envisions  calculations  that  will  tax 
most  any  available  computer.    His  calculations  are 
generally  unsophistidated,  well-behaved,  but  exceed- 
ingly arduous  -  Just  right  for  the  big  computer. 
Without  describing  .in  any  detai],  how  the  crystallog- 
rapher determines  crystal  structures,  one  can  state 
simply  that  the  process  may  be  tivided  into  several- 
parts:  '    *  !^      '  . 

1)  Data  reduction. 

2)  The  trial  structure 

3)  Structure  r^inement 

In  a  typical  determination  the  crystallographer  will 
collect  between  1,000  and  5.000  data.   ^These  will  be 


reduced  in  a  straightforward  manner  by  corrections  for 
background,  instrumental  factors,  etc^    This  type  of' 
calculation  is  routine  and  rapid.    It  is  at  this  stage 
that  the  data  will  be  entered  into  the  big  computer 
through  cards,  paper  tape,  or  magnetic  tape>  and  in 
most  installations  from  then  on  the  data  will  be  kept 
either  on  magnetic  tape  or  on  disc  files.    It  is  nei- 
ther practicable  nor  economical  to  use  card  in/card 
out  procedures  when  there  are  5,000  data  ppints. 
Moreover,  because  of 'the  large  number  of  data,  the 
calculations  in  crystallography  are  not  well  suit-ed 
to  remote  terminals  unless  high  capacity  linkages  are 
available . 

The  finding  of  the  trial  structure  may  require 
^considerable  non-routine  computing  in  which  hdiTfan"  de- 
*cision  making  is  necessary:    But  this  area  has  devel- 
oped significantly  in  the  last  decade,  and  there  is' 
the  definite  possibility  that  many  structure  deter- 
minations will  be  handled  in  an  increasingly  automatic 
manner  in  the  next  few' years.    Among  the  calculations 
involved  in  the  finding  of  a  trial  structure  are 
three-dimensional  Fourier  summations,  where  the  re- 
sultant function  is  evaluated  at  perhaps  50,000  points 
based  on  .a  summation  over  the  1,000  to  5,000  data 
points.    It  is  interesting  to  note  that  this  type  of 
calculation  required  about  4o  man-hours'  in  1950.  To- 
day it  can  be  done  in  a  minute  or  two  on  a  CDC-60OO 
class  computer.     In  addition  the  results  can  readily 
be  plotted  on  a  graphics  device.  ' 

Refinement  of  the  structure-  ufees  up  most  of*  the 
computing' time  necessary  for  the  determination -of 
crystal  structures.    Generally^  except  for  the  big- 
gest structures,  refinement  is  effected  through  ^a  non- 
linear least-squares 'procedure  in  which  one  attempts- 
to  adjjust  m  .parameters  that  define  the  structure, 
based  on  tFe  agreement  between  calculated  data  points 
and  the  ri  data  points  obtained  experimentally.  Such 
a  procedure  works  if  one  is  sufficiently^  close  to  the 
correct  solution  to  solve  successfully  for  jbhe  correc- 
tions in  the  initial  estimates.    The  calculation  re- 
quires the  inversion  of  an'jn  x  m  matrix.    By  the  nature 
of  the  problem  this*  matrix  is  real  and  symmetric  and 
hence  ^he  elements  can  be  stored  in  m(m-fl)/2  cells  in 
the  computer.     It  is  instructive  to  see  how;this 
function  varies  with  m:  s 


m 


m(m'flV2    '     Type  of  Problem 


50  1^275  ^    rmall.  organic  molecule 

200  20^100  typical  coordination  compound 

500  125,256  120  atom  problem 

5000  12,502,500  1200  atom  problem  (enzyme?).' 
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Thus  one  can  appreciate  thai  since  most  available  com- 
puters have 'only  32K  or    64Kwords  of  high-speed  core 
a  severe  restriction  is  currently  placed  on  the  re- 
finement of  large  structures.    Thus  far,  few  struc- 
tures with  m  past  500  have  been  properly  refihed.  And 
one  simply  does  not  refine  large  biological  molecules 
in  this  manner.    Yet  the  desire  to  do  so  remains, 
and  crystallographers  look^ forward  to  computers  with 
virtually  unlimited  high-speed  cores. 

Can  One  invert  such  large  matrices  to  obtain 
meaningful  answers?    Yes,  because  the  matrices  are 
well  conditioned^    It  has  even  proved  possible  to 
invert  these  large  matrices  using  the  very  short 
word  of  the  IBM  5^0  series  computers. 

Where  does  most  of  the  time  go  in'  these  refine- 
ments?   It  may  be  of  some  interest  to  note  that  the' 
rate-determining  step  is  not  in 'the  matrix  inversion, 
but  rather  in  setting  up  the  matrix.    The  ,operatiort 
involved; is  of  the.  type 

A  =  A  +  B*C 

and  this  operation  must  be  performed  nm^  times.  'On 
the  fastest  available  computers  such  an  operation 
might  require  about  1  ^isec^    If  one  has  n  =  5^000 
and  m  =•  "500  the  time  for  .setting  up  the  matrix  is 
500  sec,  a  long  time^pn  the  very  fastest  computers. 
If  the^time'for  the  crperation  is  lO  ^isec,  the  cal- 
culation will  take  ^t-^east  5/000  sec    of  central 
processor  time.    In  a  typical  calculation  on  the 

'CDG-6600,  a  refinement  inveftving  n  =3000  and 
m  =5  200  will  require  "pe.rhaps  500  sec. 

It  is  clear  that  the  crystallographer  not  only  . 
needs  computers  with  far  more  high-speed  memory 
but  also  ones  that-  are  much  faster.    The  general 
rule  has  been  that  'the  faster  the.  computer,  the  more 
calculations  one  can  perform  per  4ollar.  Although 
it  is  difficult  to'give  an'average  figure,  it  is 
probable  that  most  of  the  well-refined  structures 
reported  in  Inorganic'  Chemistry  involve  about  5K^ 
of  computing  per  structure.    Thus  if  th?  si^ngle-  - 
crystal  structure  determination  is  going  to  become 
generally  feasible  as  an  analytical  technique  it  , 

.will  have  to  await  the  next  generation  of  faster, 
and  cheaper  per  calculation,  computers. 

Control  aspects  of  small  computers  /  . 

In  the  acquisition  of  data  the  crystallographer 
Jbasically  performs  a  simple  experiment.  He  orients, 
his  single  crystal  with  respect  to  the  incoming  and 
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and  outgoing  radiation  beams.   ^To  do  this  he  drives 
three  or  four  motors  to  precomputed  positions.  -He 
then  performs  some  type  of  measurement,  perhaps  over 
an  angular  range  or  for  'a  fixed  time.    These  opera- 
tions are  ideally  suited  to  automation,  and  to  con- 
trol by  a  small  computer.    Within  the  last  four  years 
a  number  of  commercially  available  instruments  have 
come  on  the  market  (generally  at  80KJ^  to  120K;^)  in 
which  the  basic  control  device  is  a  small' computer . 
The  pioneering  work-in  this  area  was  performed  by 
W.  R.  Busing  at  Oak  Ridge  and  his  experience  has 
been  that  a  computer  with  a  4Kword  core  can  perform 
the  ^ask. 

in  such  of^erations  the  small  computer  is  not  be- 
ing used  much  'of  the  time,  as  the*  real  time  of  the 
experiment  is  slow.    For  this  reason  there 'has  been 
considerabl-e  effWt  lately  to  have  the  small  computer 
perform  reductionXsteps  on  the  previous  data  point 
while  handling  control  of  the  collection  of  the  pre- 
sent data  point.    To  do  this  in  general  one  requires 
more  tha' ^Kwords  ofVore,  but  discs  are  generally 
inexpensive  addition^  to  staall  computer^*.*  However, 
by  the  time  one  buys  k  small  computer,  'a  ,disc,  and 
perhaps  an  industry-compatible  magnetic  tape  output 
device,  one  has  invested  about  ^OKJ^  In  the  "small"  ' 
computer.  '  Perhaps  this  is  why  the  maAuf acturers 
of  small  computers  do  so  well. 

♦ 

Where  do  we  go  from  here  ' 

*The  advancement  that  has  been  obviously  needed' 
^  for  several  years  now  is  a  -direct  link  between  the 

•experiment  and  the  large  computer.  ^Tfte  small  control  ' 
,  cojnputer  is  simply  inadequate  to  perform  all  the  de- 
sired tasks,  and  if -it  is  adorned  with  myriad  acces--.  ^ 
sories  tbe  .calculations,  even*  if  f easib-le*,"  still  are 
prdhibitlvely  expensive.    If  a  .,link  between  the 
experiment  and  the  big  computer  existed  then  pne 
..^gpiald  not  only  checks  a  current  measurement-  against 
one  made  several  thousand  operations  befbre,  but  one 


data  acquisition.    Once  the  overall  structure  is 
known  it  is  possible  to  decide  ^whlch  data  points  are-  ' 
most'  critical  for -the  accurate  determination  "of  a        ^  , 
structural  feature' of  chemical  interest;*    in  this  ' 
interactive  mode  tlcie  big  computer  could  direct  the 
course  of  data  collection.        ^  '  V 

tpes  this  sound  impossible?    No,  X%  does  not  to 
thb^e  of  us  who  have  grown  up  with  computers.    To  me 
it    sounds  far  more  likely-  than  did  the  notion  20  years  . 
ago  that  the  calculations  I  w^s  then  performing  irv  4o  " 
hard  hours, of  my  time  on  ah  IBM.  604  electronic  card 
punch  would  some  day  ( the  present)  be  done  in  less  than 
a  minute,  and  at  a  far  lower  price,  - 


could  anticipate  a  routine 
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A  MODIL' STUDY  OF  KETONES  WITH  ROTATING  GROUPS  . 
S.  MacKensie,  S.  F.  M«r«occi»  and  Mary  I.  Scanlan 
Pastore  Chemical  Laboratory  add  The  Coaputtr  . 
'  Laboratory,  Uniyeraity  or  Rhode  Island 


In  phenylacatone  (I),  there  la*  electronic  inter- 
action between  the  carbonyl  group  and  the  bensene' 
ring  un4er  the  ipf luenoe  of  light  *  The  ultraviolet  / 
BfectruB  therefore  ahewa  a  grelit  increase  of  absorp- 
tion (fig.  1).  In  benzylacetone  (II),  the  probabil- 
ity of.  effective  coojperation  is  Jiuch  reduced  so  that 


.CH5-G-CH, 

2  y  3 


II 


each  grottp  shows  its  normal  uv  absorption. 


Figure  1^ 

Ultraviolet 
.  Spectra 
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Sevejral  reasons  for  this  electronic  interaction* 
have  been  proposed  in  the  literature' .  Cited  most 
widely  are  the  ideas  of  'Cookson^  and  Labhart  and 
Wagniere3.  In  the  former,  it  was  thoyght  necessary  t  . 
for  the  plane  of  the  carbonyl  group  to  face  ,the  plane 
of  the  benzene  ring  as  in  figure  2a.  In  the  latter, 
participation'  of  the  n-electrons  on  oxygen  wa^  sug- 
gested. This  idea  (in  our  hands)  led  to  strongest 
interaction  when  the  oxygen  atom  wag  nearer  the  ben-» 
zene  ring  as  in  2b. 


1^ 

z  - 

z  Figure  2  , 


Although  many     otrphenylketones  have  been  synthesized, 
the  role  of  geometry^  in  qontrolling  the  ultraviolet 
spectra  has  not  been  firmjLy  elucidated.  After  all,  the 
,angular   difference  between  2a  and  2b  is  not  terribly 
great  and  it  is  usually  true  that  precise  bond  angles 
have  not  been  measured.  Models,  quite  acpurate  for  bond 
distances,  serve  less  well  for  precise -estimation  of 
*  bond  angles.  Moreover,  certain  molecules  are  intrinsi- 
cally difficult  to  represent  with  models  and  elements 
of  uncertainty  arise  even  for  cases  of  ^fixed  geometry". 
•This, is  certainly  true  for  compound  III  (the  geometry 
of  cyclooentane  rings  is  difficult  to  represent  with  *  - 
models), 'for  IV  (the  large  ring  is  not  perfectly  rigid), 
and  perhaos  even  for  V,^lthough  errors  here  are  mu^K 
less  serious.  ^  , 
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III  IV  V  •  ^ 

•     60^,60*  90®,  90^  i20*,  60^ 

295  mfv  (224)^  2d5  •Jt.(709)^^     295  m|i  {27^)^ 

A  further  aource  of  difficulty  is  found  in  the  fact 
that  the  energy  levels  of  bensene  rings  art  somewhat 
perturbed  as  a  function  of  the  degree  df  substitution. 

rt  was  therefore  felt  desirable  to  study  the  geom- 
etry of  c(«-phenylket ones,  wherein  only  mbnosubstituted 
benxene  rings  were  present .These  can  be  prepared  in 
larg0  number  and  should  not  involvi  any  shift  in  the 
normal  energy  levels  for  the  aromatic  system.  The  sole 
difference  ir  this  family  of  ketbhes  lies  in>he  siib- 
stituents  present  on  the  o{-»carbon  which  influence  the 
positions  taken  by  the  ketone  and  ar<Hnatic  groups. 


Ph       .  /h 

^Ac:. 

29d  af^(230)    2^6mfu(217)    295  m   .(252)  ^^^5  myJi50) 


In  early  computer  programs  which  represented  this 
sltuationi  each  atom  had  three  coordinates  (x#y|Z) 
and, each  of  these  was  expressedi  where  necessary i 
as  a  function  of  the  angles  of  rotation.  A  Votation^ 
was  merely  a  recalculation  at  the  various  t^ngi^lar 
positions ♦  The  phenyl  ring  was  ."turned"  in  15^ 
increments  throughout  IdQo  while  the  ketonfe  group 
was  turned  throughout  36OO  In  the  same  steps.  At 
each  position I  the  energies  of  Interatomic  repulsloh 


J 
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wefe  calculated.  For  e^fample,  **the  repulsion  energy, 
R{H-H)  ,\  between,  two  hy$lrogen  atoms  is  a  function^  of 
of  the  distance,  D,^  between,  them, 

^R{H..H)       IQ^^.e  ,        -49*2/D^  .  . 

Such  equations  are  invalid  at  very  ^infilli  ^ra^ues  of  D 
and,  when  these  arose,  the  program  caused  thi  scrtting 
of  population  value  to  zero  and  the  bypassing  of  fur- 
ther. calculations  at  that  angular  settings  All  the  *' 
repulsion  ener|;ies  were  summedij  entered  into  tK'€r  equ- 
ation of  Arrhenius,  and  the  mole  fraction  term  calcu- 
latedt  At  ths  end  of .  the  rotation  exercise,  the  mole 
> '  fraction  terms  .Were  divided  by.  the  total  -of  sjuch  terms 
'  ^  and^thus  normalized  to  a  sum  of  1,0,  The  matrix  (12x 
25)  of  normalized  mole  fractions  is  called  the*popu- 
^  latibn  pattern  for  the  given  ketone, 

"Figure  3>  page  5>  shows  a  population  pattern.  For 
'  a  molecule  with  one  ketone  group  and  one  phenyl  ring, 
the  pattern  is  a  two-dimensional  matrix  of  numbera, 
(in  ^fig»  3,^*t4ie  numbers  have  been  represented- by  var- 
^    ious'^s^bols) ,  These  numbers  represent  the  percentage 
of  time  that  the  molecule  finds  itself  in  the  angular 
,  position  indicated -by  the  choice  of  abscissa  and  ord- 
inate. After  a^  sul)stantial  number  of  sijch  patterns  was 
pr^^pA red >  there  wts  experimentally  sought  a  single, 
matif'ix  which  could  be  used  to  multiply,  term  by  term,  . 
the  molecultff*  matrices  so       to  obtain  totals  indica- 
tive of  the  Spectral  properties  of  th^  molecules.  This 
'latter  matrix  was  called^ the  inteosity  matrix.  The 
.  best  such  intensity  matrix  found  thus  far  is  a  simple 
•  function  o*f  trigonometric  values,"  " 

KH//^)  -  sin       sin  ^  \ 

Thi^3  intensity  matrix  has  theoretical  relevance*  It  is 
related  to  the  sigma  component  of  orbital  overlap  of 
the  ketone  and  benzene  groups.  Figure  K  shows  the  plot 
: ^  obtained  using  this  function, 

in  our  later  programs,  more  economical  presentation 
^   was  necessary  because  of  the  large  number  of  atoms  and 
\i*otations.  The  program  for  3  f3-diphenyl-2-butanojie,  for 
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Figure  3 
A  Population  Pattern 

V 
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example,  gave  a  population  pattern  (12x12x25)  which 
was  three-dimensional.  Calculation  of  all  of  the 
atomic  coordinates  in  such  a  repetitive  manner  would 
have  involved  a  prohibitive  amount  of  computation 
time.  Hence,  atomic  coordinates  were  replaced  by 
precalculated  repulsion  energies >  presented  as  data 
as  a  function  of  angles  of  rotatii)n. 


'  \  interaction       data  matrix  (size) 


The  program  for  triphenylacetone  required  only  the 
first  two  matrices,  the  selection  of  repulsion  values 
being  taken  through  12x12x12x25  angular  positions. 

The  final  result  of  the  preparation  of  population 
patterns  and  multiplication  by  intensity  matrix  is 
shown  in  figure  4,  page  7,  On  the  plot,  two  lines  are 
shown,  both  of  which  are"  steep  functions  of  the  angles 
of  rotation.  The  upper  line,  on  which  seem  to  fall 
points  for  the  more  highly  alkylated  ketones,  follows 
the  equation  below, 

2  2 

^(C^H^gJ  -  610  sin  V|/  sin  ^    +  26 


The  lower  line,  the  reality  of  which -remains  under 
ex«niination,  cseas  appropriate  for  ketones  less  highly 
alkylated.  Change  of  solvent  to  EtOH  magni^es  the 
difference  between  the  two  lines  because  of-  unequal  - 
solvent  response  of  the  ketones.  Thus,  this  computer 
survey  of  phenylketones  suggested  abandonment  of  the 
idea  of  LabhSrt  and  Wagniere,  a  'later  theory,  and  re- 
af'firmation  of  the  idea  of  Cookson,  the  earlier  theory, 

Olefinic  ketones  also  should  show  exalted  n-TT* 
transitions  and,  indeed,  the  effect  has  been  often 
noted.  The  exaltations,  however,  are  much  lower  than 
for  corresponding  phenylketones.  This  fact,  and  a 
number  of  structural  considerations^  made  the  under- 
taking of  a  rotation  program  unattractive.  Instead, 
there  will  be  presented  here  certain  obser.vationis 
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Plot  of  .Extinction  / 

Coefficient .  (C^Hl2^  ^/ 
Angle  Function 
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For  "JLdentification,  the 
OC-substituents  on  phenyl- 
acetone  are  given. 
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related  to  molecules  ^of  fixed  geometry  gpd  one** theor- 
etical treatment. 

*If  the  results  of  the  phenylketone  program  above 
have  led  in  the  right . direction,  the  maximum  value^ 
of  6  of  /3  ^'J^ -unsaturated  ketones  should  be  found 
when  the  angles  ^  and  $  are  each  90  •  Dreiding  models 
of  complex  molecules  VI  and  VII  suggest  angles  of  . 
90^.  The  values  of  €  are  the  largest  thus  far 'seen. 


VII 


9. 


301  mfK  (220) 


297  m^(234) 


If  the  extinction 'coefficients  of  these  ketones  con- 
tinue to  remain  the  largest  found,  thftp  it  can  be  con- 
cluded that       ,>^-unsaturated  ketQnes(  have  values  of 
G^about  1/4  those  of  corresponding  ,0^-phenylketones. 

2  2 

If  the  sin  g;sin.<9    relationship  is  accurate,  the 
three  ketones  below  should  have  the  same2extin|tion 
coefficients  and  these  should  be  220  sin  60sin  60  or 
124. 


296  mp.  (122) 
120°  60° 


10 


300  m|t  (93  ) 
,  60°  240° 


11 


CH2 

293  nipk(ca  66) 
•  60°  60° 


Thip  pr^dietion  is  seen  to  be  true  only  in  the  first 
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approximation  and.  one  must  ask -the  question  as  to 
whether  or  not  the  relationship  dpes  not  predict 
values  too  large  at  small  values  of  ^  ,  the  angle 
of  rotation  of  the  ketone  group.  Perhaps  the  answer 
is  affirmative,  for  comparison  of  III  and  V  shows 
the  same  effect.  Unfortunately,  the  la.st  compound- 
shown  (600,60°) ,.  very  sensitive  and  easily.  ^eaTranged 
endo,  has  not  yet  been  obtained  completely* homogeneous. • 
The  possible  failure  of  the  relationship  at  such  small 
values  of  {y  may  indeed  bear  ^n  the  previous  problem, 
the  existence  of  tWo  lines  in  figure  4.  Certainly- for 
triphenylacetone  it  can  be  correctly  said  that  the 
oxygen  of  tTie  ketone  -grouii  must  lie  near  at  least 
one  phenyl  ring.  In  dialkylated  phenylacetones,  how- 
ever, population  patterns  showed  that  the  oxygen 
atom  >fas  more  likexy  to  be  tipped  away  from  the  phenyl 
ring.  Hence  the  existence  of  two  lines  in -figure. 4 
may  reflect  such  a  difference.  Just  one  more  ^corapar- 
isoh  is  shown  below. 


Ph  Ph 


.        60°     ^  '    Kffr.  120*^; 


305  m^  (301)  284  m  p.  (468) 

This  comparison  is  far  from  an  ideal  one,  being  macie 
complicated  by  structural  uncertainties  and  srolvent 
differences.  Once  again,  however,  the  ketone  having  * 
the^  greater  angle  \if  has  the  higher  extinction  coe^f- 
ficient.  ^  ^ 

Olefinic  ketones  of  fixed  geometry  have  data,  in 
summary,  which  suggest  the  need  for  minor  revision 
of  the  function  The  modification  may  need 

decreased  values  when  \U  is  less  than  90o. 

The  quantum  mechanical  development' of  a^theory  to 
match  this  circumstance  has  not  yet  been  set  forth. 
Out*  quantum  mechanical  treatment,  which  followed  the 
idea  of  Labhart  and  Wagniere,  is  set  forth  here  for 


the  fir St" time.  It  is  readily  seen  that  predicted 
spectral  values  are  not  in  accord  with  observations. 


Predicted  Spectral  Properties  of 
'       a  /S      -Unsaturated*  Ketone 
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Intensities  for  the  CT  band  are  lO-'f  and  those  for 
the  ketone  band  are  lO^f. 
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*    Data  Acquisition  and  Equipment  Control  in  Microwave 
.    Spectroscopy  Using  a  Dedicated  Digital  Computer 


M»  D.  Harmony,  D,  K*  Hendricksen,  ?•  M.  Fast,  and  D*  L.  Nordlund 
Department  of  Chemistry,  Upiversity^of  Kansas 
Lawrence,  Kansas  660kk 

Abstract.    The  University  of  JCansas  is  developing,  under  a 
National  Science  Foundation  grant,  an.  automatic  data  acquisition 
and  equipment  control  system  for  experiments*  in  microwave  spectto-. 
scopy.    The  heart  of  the  system  is  a  small ^general  purpose  digital 
computer.    Programs  have  been  written  at  the  University  which 
enable  this  .computer  to  operate  a  microyave  spectrometer  automat- 
ically.   Signal  averaging  and  smoothing  techniques  are  used  to  * 
enhance  weak  signals.    The  system  i6  currently  Used  to  study  the 
spectra  from  naturally  occurring  isotPpic  species  of  small  mole-  • 
cules;  proposed  studies  include  structure  and  energetics  of  free- 
radical  sp6cies  with  millisecond  life  times,  generated  by  a  pulsed 
UV  flash  or  electrical  discharge. 

The  research  program  in  microwave  spectroscopy  at  the 
University  of  Kansas  is  varied  anjd  inc lodes  the  usual  molecular 
structure  and  dipole  moment  determinations,  measurements  of 
quadrupole  coupling  constants  and  barriers  to  internal  motions, 
and  also  the  study  of  radicals  an^  other  unstable  spectes.  The 
microwave  spectrometer  is  of  standajrd  designj^  with  a  phase- locked 
Gertsch  system  which  controls  the  klystron  during  computer- 
assisted  operation. 

Under  a  grant  froili  the  National  Science  Foundation,,  two 
Hewlett-Packard  2116B  general-purpose  digital  computers  and  . 
associated  equipmei^t  vcre  purctiased  in  19^9^  one  for  the  microwave 
spectroscopy  laboratory,  the  other  for  the  high  temperature  labo- 
ratory, in  the  Department  of  Chemistry  at  the  University  .of  Kansas. 

This  paper  describes  the  data  acqu,isition  and  equipment 
control  system  in  the  microwave  spectroscopy  laboratory  and  its 
present  and  proposed  uses  in  det6tminlng  the  structure  and  ener- 
getics of  various  molecules.    Presented  elsewhere  in  this  confer- 
ence are  papers  which  report  work  performed  in  the  high  temperature 
laboratory  on  computer  control  of  a  quadrupole  mass  spectrometer 
(1)  and  a  high  temperature  magnetic  mass  spectrometer  (2). 


o 
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Figure  1  is  a  block  diagram  of  the  data  acquisition  and 
equipment  control  system. presently  used  in  the  microwave  spectro- 
scopy laboratory.    At  the  heart  of  the  system  is  a  Hewlett-Packard 
2116B  general-purpose  digital  computer.    This  is  a  small,  but  fast, 
stored-program  computer,  us^d  for  real-time,  on-line  interaction 
with  the  experiments  in  the  laboratory.    It  has  a  memory  cycle 
time  of  1.6  microseconds,  8,192  l6-bit  words  of  core  memory,  and 
15  input/oirtput  channels  in  a  priority  interrupt  structure,  through 
which  all  communication  with  external  devices  is  accomplished •  ' 

Varioi)^  peripheral  devices  are  connected  to  the  computer  to 
enable  it  to  carry  out  the  operations  associated  with  five  aspects 
of  laboratory  automation:  defta  acquisition,  equipment  cofitrol, 
operator  interaction,  inforaation  storage  and  retrieval,  and  data 
processing. 

Three  devices. are  used  for  data  acquisition.    A  high-speed 
analog- to-digital ^converter  preceded  by  a  data .amplifier  is  used 
tp  acquire  the  customary  lock* in  detfector  output  voltage  from  the 
microwave  spectrometer.    A  frequency  counter  is  used  in  determining 
the  microwave  frequency,  and  a  time  base  generator  provides  periodic 
signals  to  the  compute]:  for  various  timing  operations. 

The  principal  control  function  is  provided  by  a  pulsed  stepper 
motor  which  is  driven  by  digital  signals  from  the  computer  and 
tyhich  controls  the  microwave  frequency  scan.  '  \ 

» 

Op'erator  interaction  is  provided  by  a  telet5q)e,  an  oscillo- 
scope, and  a  paper  tape  reader.    The  operator  initiates  an  (experi- 
ment by  loading  the  reqtiired  progr-ams  into  the  computer  memory 
from  magnetic  tape  or  paper  tape  and  providing  initialization 
parameters  from  the  teletype.    He  may  later  request  a  chatjge  in 
procedure  or  enter  new  parameters  by  setting  appropriate  console 
switches.    Parameters  may  be  entered  through  the  paper  tape  reader 
or  teletype.  "  -  * 

Data  are  presented  to  the  operator  either  in  printed  form 
on  the  teletype  or  in  graphic  form  o^  the  oscilloscope  or  strip 
chart  recorde^ 

The  system  has  three  methods  for  storing  information  in  a 
retrievable  fashion:  directly  in  computer  memory,  on  the  magnetic 
tape,  or  via  dataphone  to  another  computer  system.    This  may  be 
either  th^  other  HP  2116b  located  in  the  high  temperature  chem- 
istry laboratory  or  the  central  Hone3rwell  655  comJ)uter. 
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Data  processing  includes  error,  checking,  converting,  scaling, 
smoothing,,  averaging,'  and'  other  data  reducing  operations.  This 
processing  c^n  be  performed  during  an  experiment  ("real-time")  or 
subsequently  from  stored  data;  it  can  be  performed  on  the  Ideal 
computer  ("on-line")  or  on  a  remote  compyter.    Timing,  considerations 
and  memory  limitations  require  that  Very  little  processing  of  data 
be  done  in  the  real-time,  on-^line  mode.    Xack  of  floating-point 
hardware  .and  high-speed  peripheral  devices  on  the  local  computer 
requires  that  any  large-scale,  processing  of  experlntental  results 
be  done  ou  the  remote  central  computer.  '  ^      *  - 

Input/output  operations  are  carried  out  via  a  hardware      ,  ,  ' 
priority  interrupt  system,  which  allows  computer  processing  and 
various  input/output  operations  to  be  performed  simultaneously. 
Thus,   the  computer  can  overlap  acquisition,  control,  display, 
storage,  and  processing  to  obtain  maximum  speed  and  efficiency. 

Three  general  types  of  computer  programs  are  required  for 
these  operations:  system,  communications  and  applications.  System 
programs  translate,  modify  and  load  other  programs,  provide  util- 
ity functions,  and  perform  hardware  tests.    Communications  pro- 
grams provide  transfer  of  information  between  the  computer  and 
its  peripheral  deyices,  and  applications  programs  are  those  which 
direct  the  computer  during  an  experiment.  '  h 

One  of  our  first  goals  for  the  comput'erized  operation  has 
been  weak  signal  enhancement.    We  now  have  the  hardwarersof tware 
configuration  which  permits  spectral  searches  to  be  made  over 
narrow-band  frequency  regions  entirely  under  control  of  the  data 
acquisition  and  equipment  control  system. 

Currently,  signal  averaging  is  performed' in  a  stabilized 
mode  using  a  phase- locked  Gertsch  system,  which  permits  ease  and 
accuracy  in  microwave  frequency  variation.    Figure  II  is  a  block 
diagram  of  the  spectromeCer-computer  system.    With  the  stepping 
motor,  the  microwave  frequency  is  varied  stepwise  through  a  rafige 
of  5-10  MHz.    At  each  step,  a  selectible  number  of  voltage  readings 
at  selectible  time  intervals  are  taken.    The  average  of  these 
readings  and  the  reading  of  the  counter  monitoring  the  Gertsch 
FM-6  are  stored  in  memory.    Each  sweep  through  the  spectral  region 
subsequent  to  the/first  is  displayed  on  the  oscilloscope  simulta- 
neously with  the  current  average  of  all  sweeps.     If  an  individual 
run  is  unacceptable  for  any  reason,   the  operator  may  reject  that 
sweep  during  or  after  the  run. 
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The  software  used  to  accomplish  signal  averaging  gives  the 
operator  flexibility  and  control  before,  during,  and  after  a  run. 
Initialization  parameters  include  the  frequency  step  size,  the 
number  of  readings  at  each  point,  the  time  interval  between 
readings,  and  the  counter  time  base.    During  a  run,  the  program 
checks  the  position  of  switches, on  the  computer,  which  allows 
control  during  execution  of  the  program. 

With  this  procedure,  we  are  getting  two  types  of  averaging.^/ 
The  first  of  these  is  short  tem  and  occurs  at  each  step  where 
we  take  repeated  voltage . readings,;  and  store  only  the  average. 
The  eliminates  relatively  high  frequency  noise.    The  second  method 
is  long  term  averaging  which  occurs  from  repetitive  scans  over 
the  entire  frequency  band.    This  eliminates  low  frequency  noise.. 
We  also  use  a  third  technique,  digital  smoothing^,  for  increasing 
signal-to-noise  ratio.    This  is  a  leastrsquares  proce*cfure  which 
must  be  used  with  some  caution* because •  it  wil-1  smooth  anything  • 
present.    Thus  a  noise  spike  may  appear  to  be  a  good*  absorption 
line.  *  '      '  /^ 

Figure  III  illustrates'^  the 'effects  of  these  types  of  ^yer>aging.; 
Part  (d)  is  one  sweep  thrpugh  a  line,  one  reading  per  point.  Part 
(c)  is  1  sweep  through  the  line,  ten  ridings  per  pdipt..    Part  (  b) 
is  k  sweeps  through  the  lin^^  ten  readings  pei*  j>oint.    P^rt  (  ci)  is 
l|-sweeps  through  the  line,  ten  readings' per  |jpiiit,  smoothed.   .All  . 
other  parameters  were,  kept  constant.^   This  absorption  peak  is  the- 
ground  state  J  =         J  «  .2;  transition  for  carbonyl  sulfide    •  , 
QI8Q12534  in' natural  abundance.    The  reisult  fn.  (  a)-  could'-be  im*^  ; 
proved  further  with  increased  averaging' but  it  si&TV,es,  to  illustrate 
the  p^r-ocedureS'.  *       ,    ^  ,         ^      ^         '      '  * 

The  usefulness,  of  .these  Signal  enhancement  teohniqqes  (witTi 
sensltiyit.y.  g^ins  of -•$.-10)  is  clear*.'*  , They  permit  the  observation 
of  weak  absorptipn  signals  tSfhich  have  b^^tr  unde,tec'^b),e  previously. 
*  The.  method  fih'd^-  it's  greatest  use  %n  locating  X^.nes  \s(|'  ibw-ab'un.fl:-  . 
ance'  iscttopic  species  in ;'c;KGir;;.  natural  abundances. 


..^    The  f  ir^t  ^oiecuie  we  are  studytrig^^in 'this  way  is  cytloprdpyl- 
amin^^    We  are  searching^  for.  this  Q^^^  isot<3pie-$pecies  in  order  to 
determine  .,a.  more  accurate  ring  sttucture.  v    _  , 

;  Our  developments  up,  to.the  p'resent  have  been  relatively  simple 
and  sti^'j^ight'-forwar^^.  We  belieX^e  .that  \^ith  the"  basic,  system  now 
op6^rating,  "it  *is  possible  tO'  deyise  n^v  experiments  not  previously 
f  eas-ible.  '  1      ,  _  .        -  . 


V 
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Pigure  III 
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*  • 

4  • 
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FOr  example,  with*appropriate  software  developments  it  should 
be  possible  to  carry  out  search,  and  analysis  routines  for  certain 
spectral  features  in  a-  real-  or  nearly  real-time  sense.  This 
procedure  might  be  carried  out  as  follows •    Assume  we  were  interested 
in  locating  triplets  having  1:5-5  relative  intensities •    A  sweep 
would  be  initiated  a!nd  would  be  permitted  to  continue  until  any 
absorption  feature  were  detected •    An  absorption  feature  would  be 
defined  ^as'any  signal  having  a  signal-to-noise  (S/N)  ratio  greater 
than  some  predetermined  value •    This  signal  would  be  saved  and  the 
sweep  mode  terminated  tempx)rarily •    The  computer-stored  signal 
would  then  be  processed. to  determine  the  number  of  maxima  and  their 
relative  intensities.    Note  that  some  type  of  frequency  width 
specification  for  the  triplet  spacing  would  need  to  be  made  also* 
If  the  observed  feature  conformed  within  predetermined  limits  to 
the  desired  1:50  multiplet  ,it  would  be  stored  for  further  analysis; 
otherwise  it  would  be  discarded  and  the /sweep  mode  resumed.  Storage 
of  good  data  could  be'  accomplished  on  the  local  magnetic  tape  or 
by  transfer  via  the  dataphone  to  the  GE-655  system.    The  latter 
mode  would  be  most  appropriate  if  large  sc^le  processing  were  needed, 
pr  if  a  nearly  real-time  analysis  were  desired. 

Another  major  aim  is  further  development  of  the  data  acquisition 
and  equipment  control  system  to  make  possible  detection  of  the  micro- 
wave^ 'spectra  of  unstable  molecules  (or  free  radicals)..    The  goal 
here  is  to  study  the  -s^tructure  and  energetics  of  species  having 
lifetimes  as^short  as  1  milliseconjl,  and  to  elucidate  theflt  kinetic 
behavior.  * 

The  method  to  be  developed  involves  the  production  of  high 
radical  concentrations  by  a  pulsed  UV  flash  or  electrical  dis- 
charge, followed  by  rapid  measurements  (at  rates  of  up  to  ca.  5^ 
KHzX  of  signal  intensity  as  a  function  of  frequency  for  short 
periods  of  time  (l  millisecond  to  1  second  typically).    The  entire 
experiment  will  be  carried  out  under  control  of  the  computer.  AH 
of  the  signal  enhancement  methods  mentioned  earlier  will  be  utilized. 

Other  experiments  which  utilize  the  r^pid  measurement  technique 
will  be  investigated  also.     These  include  a  broad  class  known  as 
relaxation  phenotnena.    For  example,  the  time  dependence  of  absorption 
signals  dttring  the  on-off  square  wave  modulation  period  (modulation 
frequency  is  IOO-5OOO  Hz)  provides  information  about  the  transfer 
of  rotational  energy  betwe|JL^rotationaIly  excited  moiecules  and 
other  (non-excited)  molecu^s.  * 

In  these  rapi(/-measurement  'experiments  the  principle  design  \ 
problems  lie  in  tV>e  timing  requirement's  for  the  discharge  or  flash 
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pulse  and  the  subsequent  measuring  cycle;  and  also  in  the  handling 
of  the  large  voliime  of  data  which  will  be  generated •    Our  system 
has  the  capabilities  for  data  inputs  up  to  a  rate  approaching  50 
KHz  ( limited  by  A-D  converter)  but,  of  course^  the  limited  core 
memory  prohibits  such  a  rate  for  long  periods  of  time/    The  pro' 
cedure  will  be  to  take  data  in  short  bursts  and  to  dump  onto 
magnetic  tape  files  for  processing  off-line*  at  a  later  time. 
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I.  INTRODUfTION 

We  will. try,  irt  this- brief  paper,  to  present  those  high- 
lights of 'our  two  and  one-Kalf  years  experience  in  using  an 
on-line  computer,  time-shared  among  several  users,-  which  .we 
hope  will  be  useful  to  other  chemists  wlio  have  set-up  or  are 
planning  to  set  up  similar  systems.    First  we  will  describe 
the^ types  of  uses  in  which  We*re  involved,  -to  orient  you  as 
chemists.    Second  we* 11  describe  our  IBM  1800  based  system, 
its  hardware  and  software,  of  course,  but  more  important  the* 
philosophy  and  approach  which  have  made  this  an  extraordin- 
arily successful  venture.    Last  we'll  describe  the  sbrt  of  ^ 
ideal  system  we'd  shoot  for  if  we  were  making  our  plans  today, 
instead  of  in,  1968 • 

•   II.    CHEMICAL  USES 

As  in  any  chemistry  department,  we  have  a  range  of  faculty 
interest  in  computers,  from  those  who  find  them  indfispensible 
to  those  who  find  them  u^e.less.    Four  faculty  members  who  felt 
their  research  strongly  demanded  on-line  capability,  both  to 
run  their  experiments  and  to  allow  "hands-on*'  types  of  modelling 
and  data  analysis,  joined  together  tQ  write  the  proposals  for  ^ 
the  grants  which  support  the,  system^  first  to  NSF  for  a  Pepart- 
mental  Equipment  Hi^ant,  and  then  to  NIH  "f9r^ operating  and  ex- 
pansion funds.    Sevetal  types  of  experiments  now  run  or  will 
run  sijnultaneously  on  the  system  (see  Pig.  1). 

Jim  Arnold  runs  a  multiparameter  particle  detection  system 
for  cosmochelnical  studies. 

^\lrray  Goodman,  who  has  just  joined  the  department,  plans 
to  interface  two  Cary  spectrometers,  oae  for  circular  dichroism 
measurements,  through  a  minicomputer  to  the  IBM  1800'to  facili-' 
tate  his  studies  of  the;  stereochemisf^ry  cff  polyrpcrs,  biopolymers, 
and  their  model  compounds. 

Joe  Kraut  and  Nguyen  Xuohg  use  the  system  for  X-ray  crystal - 
lographic  determinations  of  the  structure  and  function  of  pro- 
teins.   Thd  X-r^  diffraction  data  is  gathered  in  two  ways,  first 
through  a  diffractometer  interfaced  to  a  dedicated  PDP-8,*which 
in  turn  is  interfaced  to  t^e  l^K  1800>.  and  second  through  a 
camera  and  an  automatic  scannirig  film  digitizer    which  is  dir- 
ectly interfaced  to  the  1800. 
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Bob  Void  is  finishing  the  hardware  to  int^rfac'e  his  pulsed 
NMP  spectrometer  wHich  he  is  using  to  study  liqyid  crystals j 
protein-substrate  interactions,  and  protein  denaturation. 

Kent  Wilson  runs  his  photofragment  spectrometer^  through  . 
the  1800  and  collects  and  analyzes  several  million  bits  of  in- 
formation per  day  in  studies  of  dissociative  molecular  states 
and  of  air  pollution:    In  addition,  he  uses  the  computer  system 
for  on-line  generation  of  computer  animated'  films. 

Nguyen  Xuong  is  usilfig  the  directly  interfaced  film  scanner 
for  human  chromosome  analysis,  as  well  as  for  X-ray  diffraction 
studies.        '  '     '  « 

In  addition,  the  above  research  groups  and  many  others  in 
the  Chemistiy  Department  use  the  system  extensively  for  inter- 
active* data  an^lys-is,  model  testing,  and  Qours^e' grading . 

III.  'PRESENT  SYSTEM 

"A.  Hardware 

The  system  we  use,  based  on  .an  IBM  1800,  is  shown  in  Fig.  1 
At  present  we  have  32K  of  2  usee  core,  30  million  bytes  of 
medium  speeij  disk^  magnetic  tape,  two  typewriters,  card  reader 
and  punch,  line  printer,  incremental  plotter,  several  storage 
CRT^s,  a  mouse,  various  high  apd  -low  speed  digital  inputs  and 
outputs,  DAC's,  ADC's,  and  interfaces  to  film  scanner,  TV 
camera  and  experiments,.    The  purchase  cost  of  the  system  would 
be  approximately  a  ftiird  of  a  millions-dollars,  but  .by  buying 
various  subsections  as  they  became  available  from  other  compu- 
ters (using- purchase  credits  built  up /from^  rentals  paid  on 
other  U.C.  computers),  by  ttaditig  interfacing  for  equipment 
with  manufacturers,  and  by  purchasing  used  equipment,  we've 
assembled  the  system  for. considerably  less. 

^    '  .  »    B.    §i  ftware 

The  system  runs  under  MPX,  a  multiprogramming  executive. 
Various  users  have  different  partitions  in  coi^e.    Most  data 
collection  and  all  d^ta -manipulation  and  storage  is  by  Fortran 
routines  brought  in  from  disk.    Core  swapping  requires  less 
than  a  second,  in  most  cases,  and  this  is  sufficiently  fast  for 
Virtually  all  applications.    With  swapping  we  can  reserve  a 
large  background  partition  for  intermediate  or  final  data  pro- 
cessing. •  .       '  ^ 

C.    Philosophy  ^ 

Our  general  reason  for  setting  up  the  systewf  ts  that  each 
of  us  can  afford  more  powerful  and  convenient  computer  usage 
by  banding  togethx?r  ill  a  time-shared  system  than  we  could  if  wc 
each  tried  to  set  up  our  own  separate  computer.    We  feel  that 
demand  for  usage  should  flow  from  below,  from  the  potential 
users*    Vou  can  lead  a  potential  user  to  a  computer,  but  you 
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can't  make  him  use  it,    A  sizable  fraction  of  any  group  of 
chemists  will  find  on-line  computers  useful,  but  it's  on-ly 
asking  for  frustratio;i -to  try  tQ  bring  ever>^one  into  it. 
Therefore,  although  we  make  the  computer  available  to  all,  we 
find  that  only  a  third  of  the' research  groups  in  the  depart- 
ment use  it. 

We  .would  rather  have  an  old-fashione'd  system  which  works 
than  a  futuristic  one  which  doesn't.     (This  doesn't  mean  we'd 
buy  an  1800  system  today       s^e  the  next  section.)^  Within 
six  weeks  of  delivery  we  w^re  running  data  programs  and  within 
three -months  we  were  time-sharing  three  experiments.  Sve*ve 
learned,  particularly  in  software,  to  buy  realities,  not 
.promises. 

The  economics  of  a  system  needs  to  be  tuned  for  most  ef- 
ficient usage.    Anyone  can  write  a  program  which  will  tie  up 
a  computer  almost  forever.^  The  hourly  charges  to  users  need 
to  be  high  enough  so  that  everyone  thinks  about  efficiency, 
but  not  high  enough  so  that  the  machine  stands  idle.  Our 
present  rates  are  $6  per  hour.    The  costs  of  interfacing  new 
on-line  experiments  are  largely  borne  by  the  individual  re- 
search groups,  'again  to  encourage  each  group  to  think  care- 
fully about  the -financial  realities  of  their  usage. 

Staffing  is  probably  the  most  important  decision  to  be 
made.    One  great  software  or  hardware  man  is  worth  a  half- 
-dozen  mediocre  ones^.    A  jsystem  this  size  shouldn't  be  run 
by  a  staff  operator,  but  should  6e  open  24  hours  a  day  for 
operation  by  the  users  themselves.    They  then  form  a  group* 
which  quickly  educates  newcomers.    Our  system  rarely  crashes, 
even  with  dozens  of  users. 

We've  made  two  serious  errors,  which  still  are  with  us. 
The  first  is  in  having  only  one  permanent  staff  member.  The 
load  is  just  too  great;  he  can't  get  away  for  vacations  because 
so  many  groups  depend  on  kim.    The  second  mistake,  and  one  to 
which  we  see  no^good  answer,  is  t^at  we  must  write  prant  pro-  - 
posals  several  years  in  advance.    The  computer  .equipment  we'll 
buy  three  years  from  now  is  probably  only  a  dream  in  a  designer 
head,    thus  it's  very  difficult  to  justify  equipment  money  in 
advance,  an4  w.e  are  chronically  short  of 'it. 

'      .     .  IV.     IDEAL  SYSTEM 

■*  • 

The  ideal  system  we'd  shoot  for,  if  wt  were  starting  in 
1971,  is  jShown  in  Fig.  2.    There  is  nothing  original  in  this 
concept,  but  we'd  like  ta  add  our  weight  in  what  we  feel  is  the 
rij^ht  direction*  '  ♦  . 
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Two  things  have  changed  since  1968.    First,  minicompu- 
ters have  dropped  drastically  in  price,  and  it's  clear  that 
henceforth  almost  all  new-  laboratory  instruments. will  come 
with  interface,  if  not  mini,  attached.    Second,  computer  cen-  . 
ters  are  becoming  more  willing,  in  part  because  nfewer  com- 
puters make  it  easier,  to  allow  outside  computers  to  tie  in. 
Therefore,  we'd  try  tp  set  up  a  double  or  triple  layer  system, 
with  many  minis  in  the  laboratories,  .perhaps  an  intermediate 
processorj.n  the  middle,  and  finally* a . large  computer  for 
number  crunching  and  massive  storage  at  top. 

The  laboratory  end  would  utilize  very  small  mini  compu- 
ters; with  ADC's,  DAC|s,  pulse  counters,  and  other  gadgets 
for  digitizing  data  and  controlling  experiments.    The  mini 
would  buffer  data  between  the  experiment  and,,  the  main  compu- 
ter.   The  terminal  devices  could  be  anything  with  a  keyboard 
and  some  fcrm  of  output.    Programming  of  the  main  computer 
could  be  done  through  the  laboratory  terminal  but  more  usually 
would  be  by  remote  terminals  outside  of  the  laboratory  area. 
The  mini  computer  would  interpret  instructions  sent  by  the 
main  computer  to  obtain  sequence  of  operations,  number  of  iter- 
ations, output  values  to  the  instrument,  efc. 

The  high  speed  laboratory  communications  port  could  be  a 
small  or  medium  computer  with  its  own  disk  storage  and  a  very 
limited  computational  capability.    Thq  primary  purpose,  of  this 
computer  would  be  to  generate  short,  fast  jobs  and  act  as  ' 
scheduler  and  message  concentrator  for  the  main  computer.  " 

The  main  computer  would  b^  a  central  data  processing 
facility  for  a  campus  or  large  laboratory.    It  would  probably 
have  multiprogramming  capability  with  a  smalL  partition  as- 
signed to  the  laboratory  computer 'system .    A  high  level  lan- 
guage would  be  interfaced  to  the  laboratory  communications 
•port.    Facilities  for  generating  interpreter  strings  for  the 
mini  would  be  available.         '  . 

The  sequence  of  operations  would  be  something  like  this: 

i)    The  researcher  sits  down  at  a  terminal  and  makes  con- 
nection with  the  regular  time-sharing  system  running 
from  the  central  computer.    He  proceeds  to  write  a  program 
to  treat  his  data.    Atthe  point  in  the  program  where  data 
is  required,  he  performs  an  output  operation  referencing 
a  string  of  interpreter  instructions.    This  output  oper- 
ation is  followed  by  an  input  operation  referencing 
the  variables  he  expects  to  get.    Subsequent  input  oper- 
•ations  are  controlled  by  the  last  interpreter  string  out- 
put.   Wlnen  he  completes  the  program  he  ^aves  it  in  the  * 
permanent  library  of  the  main  computer.  ' 

ii)    U^hen  the  experimenter  is  ready  to  run,  the  experi- 
menter calls  up  the  program  he  has  written  in  step  i). 
The  interpreter  string  is  transferred  to  the  mini.  l\1ien 
the  read  operation  occurs,  the  communications  procpssor 
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causes  workspace  to  be  saved  and  the  mini  commences 
execution  of  the  interpreter  string.    The  data  is 
collected  by.  t^e  mini  and  sent  to  the  communications, 
processor.    The  data  and  control  information  is  saved 
on  the  communications  processor's  .disk  in  a  queue  of 
jobs  to  be  run  by  the  main  computer.    ivTien  the  com- 
munications unit  gets  to  that  position  in  the  queue 
it  calls  for  the  saved  workspace  and  passes  the  data  to 
it.    The  information  is  then  operated  on  and  any  feedback 
required  is  sent  to  the  mini.    If  n-ecessary,  a  new  in- 
terpreter string  is  sent  to  the  min^i  to  change  the 
character  of  the  experiment.    If  data  is  to  be  saved  on 
a  file,  the  main  computer  does  so.    The  workspace  is 
then  freed  for  another  job  until  more  data  is  available. 

iii)    After  all  the  data  is  collected,  the  researcher  goes 
back  to  his  office  and  sits  down  at  a  terminal.    He  may 
•now  perform  any  data  analysis  he  wishes.  Meanwhile, 
someone  else  can  be  using  the  laboratory  instrument, 
with  the  same  or  different  programs  in  the  mini.  If 
more -immediate  data  analysis  is  required,  it  can^be  per- 
formed simultaneously  with  the  running  of  the  experi- 
ment, and'displayed  in  the  laboratory. 

The  beauty  of  this  scheme  is  the  systems  responsibility. 
There  is  one  central  responsible  agency;  at  a  college  or 
university  probably  the  campus  computer  center.  Problems 
such  as  data  management  and  general  analysis- programs  need 
to  be  solved  only  once.    Laboratory  programs  would  be  port- 
able from  laboratory  to  laboratory.    The  additional  invest- 
ment by  any  one  laboratory,  .once  the  system  was  implemented, 
would  be  under  $10,000,  yet  direct  access  would  be  provided  to 
the  power  of  a  large  computer. 

No  systems  programming  need  be  done  by  the  researcher. 
A1:  the  very  wor^t  he  might  hire  a  student  "programmer  to  do 
some  of  the  more  annoying  work.    In  addition,  a  single  labor- 
atory mini  computer  might  initially  serve  several,  laboratories . 
The  computing  center  would  hire  one  or  two  interface  consul - 
*tants  and  they  could  provide  valuable  direction,  if  not  actual 
hardware,* to  the  researcher  at  the  outset. 

Clearly  one  needs  a  computer  center  that  is  sensitive  to 
the  needs  of  the  chemist.    With  the  present  financial  state  of 
most  centers,  perhaps  they  will  be  more  willing  to  tap  a  rapidly 
expanding  need  than  they  have  been  in  the  past. 

The  initial  investment  reauired  to  set  up  a  system  serving 
all  the  laboratory  sciences  depends  on  the  f^^ci'lities  available 
at  the  specific  computer  center.    If  there  is  already  a  time- 
-sharing  system,  it  should  be  less  than  ^2S0,000.    This  cost  is 
very  reasonable,  and  could  be  recouped,  if  necessary,  by  user 
charges. 
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D.  W.  Schlosser,  and      R.  Wilson,  Rev.  Sci.  Instr.  41, 
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'  FIG.  1.    The  existing  laboratory  automation  system  consists 
of  an  IBM  1801,  four  IBM  2311  disk  files,  a 
^  Kennedy  MTS-10/1800  magnetic  tape,  a  Holly  line 

>  printer,  three  Tektronix  611  storage  displays,  an 

IBM  1442  card  reader-punch,  and  two  IBM' 1816 
selectric  typewriters.    The  PDP-8  has  an  ASR  35 
as  10. 

FIG.  2.    Our  "ideal*'  laboratory  automation  system  consists 
6f  a  time-shared  certtral  computer,  for  example 
a  PDP-10,  about. 100  megabytes  of  disk  storage  for 
remote  use  and  a  smaller  drum  storage  for  system  * 
use.    The  laboratory  communications  computer  would 
be  a  PDP-11,  IBM  1130  or  1800,  or  any  computer  of 
that  general  size.    Disk  $torage  would  probably  be 
about  1  megabyte.    The  lab  computers  would  be 
eight  or  16  bit  mini  computers  with  about  4K  or 
•less  storage. 
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Application  of  the  Monte  C&rlo  Method  to  First70rder ' 
Rate  Equations  .     '  ^  \      \  l'^,,.  [  ;  -  ■ ... 

John  J,  Manock  and  David  L.  Hooper,  ChBmis^try./Departm^r^tv 
'Western',  Carolina  University,  Cull6wh3e,'  N.  Q^-  .2$723 


There  have  been--  several' articles  wriitefr  ^hich' kave'  * 
illustrated  the  application;pf  th^  Monte* Carlo  method  to 
the  area  of  chemical,  kinetics  ( 1 , 2i. ...This  .me^hsdd  ife  . . ' 
applicable  to  any  system  which  ca.n'b.e  treated  statis-  " 
tically  as  a  sequence  ,of  'purely  random  ev-ents  anci  is 
particularly  useful  for  systems.,  which  ^  dp -'not  y^^ld-.  ari,.;*  _ 
integrated  soluti'O.n .  from  the.  closed'  analyt4"dal /foVifn  or  . 
where  the  physical  problem  ±.s  loss  in  th$  complexity^; 
of  the  mathematical' solution;  (^3  ,4)-..  It  al3b  eliminatfe-?; 
the  necessity  for  a  steady-state  approxinfeticn  Which 
cin  result  in  erroneous  re3ults;|'(3  ,5)-.       "  ' 

A  general  computer ,  program ,  written  ;in. 'Fortran  JV 
'L.evel  'F;  has  been  developed  ^s^hich  -uses "the  Morite"  Carlo 
method  t.o' simulate  the  course  for  any  first-order  >eac^' 
tion:  reversible , 'consecutive ,  and/or  parallel.    This  .: 
program  is  ideal  for  student  a-se  with  either  a. -terminal 
or  an  on-liae  printer.     The  student,  is  r^quir^ed  to 
supply  only  relative  rate  constants  and^  initial  jlonc^tn- 
trations)  and  .he  can  instantly  observe  reversible con-  - 
secutive,  and' parallel  reactions. or  any  combination. of 
these  ♦     The  flow  ^hart  for  the  progra'm  is.  showrf  in  ^'fjigure 
1,    As  in  the  caje  of  the  following  generai*  rea'ction' 

o/etc.    '  ^ 


the  non-zero  rate  cor^stants,  R(L,M),  and  the  Initial  \ 
mole  fradtion  concentrations  of  A/'B,  C,  etc,  are  .pro-r*  " 
vided  by  the  invest.igatoi;/    A  matrix  is  developed  -with 
number*- 1  for  molecule  A,  number  2^  f of"  vmol6cule  B.,  etc. 
These-  numbers  are  placed  in  the  matrix  ^.n  the  ratio  of';'^ 
the  mole  f ractions.^,,    The  largest  common' iiivider-  among  "' 
the  rate  constants  is  determined  ^nd  is  designated  as  - 
TN  in  Figure  1,     This  value  is  chosen  as  the  incremental" 
test  factor.    Each  specific  non-zero  rate  constant  i^.. 
then  tested' in  succession  relative  to  the  variable,  SN", 
and  if  a  specific  rate  constant  is  larger  tha'h 'or- equal 
to  this  variable,  that  molecule  can  react*.    Rando'iijly  a" 
molecule  is  chosen  f rom  , the. storage  .matrix  and  ^tested  " 
to  see  if  it  is  the'  molecule  desired.     If  it  is  the  • 
desired  reactant,  a  fruitful  event  has  occured  and  this 
molecule  is  replaced  by  the  corresponding  product.  [    '  -  ^ 


\ 
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Figure  1.-  Flow  chart  'for  the  Monte  Carlo 
FrograjnT 


If  the  randomly  *  chosen  mplsGule  *ls  not  the^  malecule  de- 
sired', no  friijLtfiCL^  eVent  .occurs  a-nd  the. next  rate  .con--, 
stant  is  considered.---~'''Wh.enr  air*ri6nr-^ero  .'rate  •  constants 
have  beenr  tested,  SN-  i^i  incremented  by  TN , and' the'' pro- 
cedure "i^  fepea'^ed/    When  SN  becomes  l,arge-r  tha.n  the 
maximunf  rat'e  coh'stant,  an  element  of  time'  i*n  noted',  SN 
is  initialised  to 'TN^^  and-tl^e  process  is  repeated.  At 
given  increment  s  of  t^ime^  .jrelatiy-e  ^  cqncent  rat-Ions  of  A, 

,B,  "C,  etc."  are  .tabulated.     Th^  reaction  is  con1:inued' 
until  95^',completi'on  has- "occured  or  until  equilibrium,  is 
established.  ''At  this  point  "the  concentration^^  o-f  the 
components  as /functions  or  time:  are  plotted  on  an  on- 
line printer.   'The' program ,  including  th"e,  plot  subrou- 

'tine,  requires,  $P;K  a?\d'.  takes;  approximately  five^minutes 
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to  execute- on  an  IBM  36O,  Model  30,*  xomputer.  '  By  re- 
moving the  plot  subroutine  and  reducing  the  si.ze  of  the 
storage  matrix,  *  thj^s  program  can  be  readily  adaptei^  .to  , 
smaller  computers-.       •  ^       .  '  - 

To  illustrate  this.p-r.ogram  with  a  specific  example  j 
consider  the  following  consecutive  reaction 


where  the  initial  mole  fraction  of  A  is  1.0,  H(l,2)  =^1.0 
and  R(2,3)  =  2.0.     The  common  divider,  TN,,is'l.O.     SN  , 
is  initially  set  equal  to  this  value.     In  the  first  cycle 
the- rate  constants  R(l,2)  and  R(2,3)^are  tested- and  the 
value  for  SN  is  increased  to  2.0.     For  the  second  cycle 
only  R(2^3)  is  tested,  thus,  requiring  that  the  molecules 
A  and  B  be  considered  in!  the  ratio  of  one  to  two.  Upon 
testing  R(l,2),  if  the  randomly-  chosen'tnolecule  is  1  . 
then  a  fruitful  event  has  occured  and  the  storage  area  , 
is  replaced  with  the  number  2,  and  likewise  when, testing 
the  rate  equation  R(2,3),  if  the  randomly  chosen  jnolecule 
is  the  number  2  then  the  storage .area  for  this  molecule 
is  replaced  with  the  number  3'.    This  process  AS*co,ntinued 
until  the  reaction  shows  completion.   .As  shown  in  Figure 
2,  results  from  this  program  agre^  within  1^  of  the  in- 
tegrated values.  This  precision  can  be  increased  by  ex- 
panding the  relative  sample  size,  however,  the  time  re- 
quired for  execution  increase  proportionally.  : 

This  program  provides  a  method  for  readily  evaluatijig- 
proposed  mechanisms  of  first-order  and  pseudo-first-order 
reactions  by  comparing  empirical  data  wj.th  'calculated 
values.    A  proposed  mechanism  can  often- be  accepted  or 
rejected  by  visual  inspection.     In  addition,  this  pro- 
gram can  be  used  as  a  method  for  finding 'relative  rate 
constants  for  a  complex  system  by  comparing  experimental 
values  with  calculated  values  arid  changing  the  relative 
rate  constants  by  a  corrective  iteration  process.  "The 
program  is  currently  being  applied  to  problems  of  this 
type  in  photochemical  reactions. 

The  authors  would  like  to  express  their  appreciation  • 
^ to  the  WCU  Computer  Center  "for  assistance  with  this 
project  and  the"  North  Carolina  Academy  Q-f  Science  f-or^ 
financial  support,  .  . 
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Figure  2.     Coipparison*  of  integrated  vaXues 
I—)  and  ,resiilts  obtained  from  the  Monte 
Carlo  method  ^(  o  ) .  • 
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Series  Integration  of  Consecutive  Rate  Equations 
Howard  J.  Butcher"^*  and  Norman  C.  Peterson^^ 


ABSTRACT 


The  integrated  rate  laws  for  two  complex  reaction  schemes  are 
developed  in  terms. of  converging  series  allowing •calculation  to  any 
desired  accuracy.  The  rate  laws  So  not  require  tabulated  functions 
for  computa^fion.    The  schemes  are: 

k    '  ■ 

B  +  C   ^ 

k„     .  ■       ■      •  . 

D   ^    E       ■  • 

and 


Examples  of  intermedij^te^^ncentration  [D]  vs.  time  curves  evaluated  using 
a  computer  are  illu^rated  graphically. 


Taken  in^  part  from  the  Thesis  of  H..J.  Butcher, '  submitted  in  partial 
fulfillment  of  the  requirements  for  the  degree  of  Doctor  of  Philosophy 
in  Chemistry  to  the  Polytechnic  Institute  of  Brooklyn,  1966.    Woi^  sup- 
ported- in  part  by  a  National  Institutes  of  Health  Predoctoral  Fellowship*  ~ 
to  H.J.B.  and  in  part  by  PTThS.  Research  Grant  No. "AM  10946-03  .from ,  the  \ 
National  Institute  of  Arthritic  and- Metabolic  Diseases.- 

St.  Mary's  College  of  Maryland, .  St.  Mary's  City, "Maryland  20686 

^Polytechnic  Institute  of  Brooklyn,  Brooklyn,  New  York  11201 
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Solutions  to  the  differential  equations  arising  fxom  complex, 
coupled  rate  equations  have  been  regarded  as  generally  too  diffi::ult 
for  exact  solutions.     In  the  era  'before  computers  were  widely  avail- 
able much  experimentaj.  information  was  lost  by  ,  resorting  to  steady 
state  or  other  approximate  solutions  in  treating  experimental  data. 
A^ong  the  ipany  areas  stimulated  by  the  increased  availability  of  high 
speed  electronic  digital  computers,  is  the  non-steady-state  intergra- 
tion  of  rate  Equations  by  numerical  methods.     Fairly  complex  kinetic 
schemes  can  now  be  solved  by  numerical  integration  algorithms.,  such  as 
^the  fourth  order  Runge-Kutta  procedure  (4)  or  predictor-corrector 
algorithms (5), 

Although  these  are  generally  useful  techniques,  questions  of 
accuracy  and  computation  time  can  arise.     In  particular,  numerical  solu- 
tions such  as  these  can  be  in  error  or  even  unstable  if  the  integ- 
ration step  size  and  round  off  errors  are  not  carefully  optimized,  and 
these  numerical  teqhniques'  must  be  carefully  checked  against  accurate 
solutions  to  insure  confidence  in  the  results.     It  is  necessary  to  have 
exact  solutions  available  in  analytical  form  to  check  the  accuracy  of 
the  numerical  methods.     Iiltegr^ted  rate  equations  for  many  of  the 
simpler,  rate  equations  of  kinetics  are  known' in  terms  of  elementary  ^ 
functions.     Solutions  for  several  mechanistic  schemes  have  been  ob- 
tained in  terras  of  tabulated  functions.   (6-9)*but  often  such  solutions 
are  not  readily  translatable  into  a  computer  program. 

In  this  p^per  we  advocate  the  use  af  solutions  -in  terms  of  infinite 
series,  which  are  readily  programmed  for  raichine  computation.  Because 
they  are  exact  solutions  they  are^not  subject  to*  numerical  instab- 
ilities.    Series  solutions  are  therefore  more  accurate  than  solutions 
obtained  by  numerical  integration,     finally,  series  solutions  are 
generally  more  directly  obtainable  than  solutions-^n  terms  of  special 
functions.    Few  examples  are  found  in  the  literature  of  integrated  ^ 
rate  laws  of  any  type.     It  is  hoped  that  examples  of  this  procedure 
•.will  help  to  reacquaint  chemical  kifteticists  with  some  classical 
tools  for  solving  differencial  equations.     The  examples  presented 
he^re  are  rapidly  Convergent  to  high  accuracy.     These  were  developed  for 
use  in  a  non-linear  estimation  program  for  estimating  rate  constants 
^nd  testing  mechanisms,  a  procedure  which  would  hfeve  been  impractical 
without  the  accuracy  and  speed  of  a  digital  computer. 

A  series  solution  arising  from  the  irreversible,  mixed  second  order 
first  order  sequence  is  given  below.     It  is  shown  how  to  compute  the 
concentration  of  species  to  any  predetermined  accuracy. 

The  mechanism  is: 


The  rate  equations  are: ' 


-dB         -dC  • 


dD 
dt 


k^BC-k^D  -  (2) 


'  ■  dE 

dT    =  r 

Taking  Bo  end  Co  as  the  initial  values  pf  B  and  C,  the  instantaneous 
concentrations  of  the  reactants,  equation  (1)  is  easily  integrated  to 
give  the  well-known  result: 

in    igSl    -    k,t       '  (4) 


Bo-Co  1 
After  making  the  substitution: 

^  G    =    Bo  -  Co 


and  noting  the  stoichiometric  requirement  that  B-C  =  Bo-Co,  solving 
for  B  and  C  gi\>es: 


B  = 


Bo-Co  exp[-Gk^t] 
and 

.      C   .=    B(Co/B6)  exp^(-Gk^t). 


The  time"  rate  of  change  of  the  intermediate  concentration,  D,  is  given 
by   equation  (2)  or 

•  2 

^         *  A  (Co/Bo)  exp  (-Gk^t) 

"dt  [1- (Co/Bo)    feJcp  (-Gk.t)]^  •  (2a) 


Now  let'Z  =  (Co/Bo)  ^xp  (-Gk^t)  and  let  ^  ^\  .    The  differential 

equatipn  (2a).  can  then  be  transformed  to:^     •  '  ^ 


\ 


This  equation  has  the  integrating  factor  z'^. 


or 

G  dZ 


Z'*(l-Z)2 


'Formally  integrating  between  limits  D=0,  Z=Co/Bo  =Zo  at  t=0'  gives- 

fz        dZ  ■  .  6  ■ 

D    =    -GZ^  JZo 


zM(i-z)2 

Define  the  function  R  (Z)  by 

R(z)   =  J^^ 


z*^(i-z)'   •    ;  . 

To  .aid  in  evaluating  the  integral,  it  is  useful  to  apply  the  binomial 
expansion 

(i-z)      =  S  (j+i)zJ  -    ,  .  ■ 

j=o 

2  '  . 

This  series  is  convergent  for  2    <  1. 

Now  it  can  be  assumed  without  loss  of  generality  that  Bp>^o  ^nd 
G  >0  because  of  the  symmetry  of  the  equations  with  respect  to  interchange  ' 
of  B  and  C.    Thus,  it  is  evident  that  r<  1  for  all  t.    The  series  is 
therefore  uniformly  convergent.    After  being  expanded,  R(Z)  becomes 


=  J  E  (j+i)zJ"*^dz 


R(z) 

j  =0 


This  series  is  readily  integrated  term  by  teEfins,,jj^  Thus  (10)  if  ^  5^  j , 

00 

R(z)    =       L  jZ^ 

J    "  1  A  


and  '  *  . 

D    =    GZ*^  [R(Zo)  -  R(Z)]  .  ^  (5) 

The  concentration  of  product  Ey  Is  obtained  by  difference  viV>. 
E  =  C^-C-D+Eo.   ,To  examine  the  convergence  of  R(Z),  the  jjifinite  series 
can  be  approximated  by  a  finite  partial  sum  with  a  pre-assigned  accuracy. 
The  truncation  error,  the  difference  between  th>a  n-tb  partial  sum  and  the 
infinite  sum,  can  be  assigned  .an  upper  limit'(ll)  .by  considering  the'discrete 
summation  variable  as  a  continuous  one,  so  thatj 

•   i3a  .•     >  ■ 
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I     jZ^di  I    _    I  r»  z^di  I 


where  ^    is- the  magnitude  of  the  truncation  error  iu  R  (Z)/Z. 
•  n 

The  denominator  can  be  replaced  by  (1-^/n)  preserving  the  inequality 
since  ji/n  is  a  small  positive  number  less  than  1  for  j  sufficiently  large, 
i»e*     n>j.     Thus  an  upper  limit  On  the  truncation  error  is  given  by: 


Iroo 


Z-'dj  ^      Z"  (n>  ^) 

(l-)i/n)l  (1-^/n)  jlnZ| 


For  example,  when  Z=0.5,  ^^=0. 90909          the  series  was  evaluated  until  the 

nih  term  was  less  than  10.     of  the  n  tfh  parta^al  sum.    This  required  22  terms 
and  led  to  a.  truncation  error  as  calculated  above,  of  ^22*^^^"         where,  the 
sum  itself  is  6.29286.    As  the  time  increases  Z  becomes  smaller,  and  fewer 
terms  are  needed  to  obtain  an  accurate  value  of  the  sum. 

For  integer  ij  the  integral  can  be  expressed  exactly  in  closed  form  (12>. 

R  (Z)  =   ^  ,  — 


Let 


When  ^  =  '  *^ 

R^(Z).^1/(1-Z)  -  In  [(1-Z)/Z]  (6)  ' 

FotjU>l,  the^  recursion  formula  (7)  can  be  used  to  reduce  the'^^'integral  (9) 
to  one  in  'terms  of  R^(Z),  l>e,  equation  (6),  .     -  . 

-1        ,  .  m  '  ' 

R  (Z)  =   z  ^+    R    ,(Z)  0) 

:       ^  ^  (m-l)z"^"^(l-Z>  m-l      "^"^  '  /  ' 

The  integrated  rate  equation  has  been  programed  as  a  Fortran  subroutine 
for  use  with  a  program  for  nonlinear  estimation  by  non-linear  least 
sq4iares  curve-fitting. "  (13)  .    .  , 

When  the  -ratio  if  iftitiapl  concentrations  Co/Bo  is  small- the  solution 
♦  for  D  should  reduce  to  the  same  form  as  that  for  the  intermedtate  in  the 
irreversible  fir^t  order-first  order  sequence.     This  is  .evident  as  follows: 
suppose  Co/Bo  '«1,  then  G      Bo  and  j^x     K^/CBok^).    Only  the  first  term 
of  the  infinite  series  is  needed  because  of  the  rapid  Cpnvergence.  After 
substituting  in  equation  (5)  and  simplifying, 'one  obtaitjs:^  . 

k^BoCo        ^  .     _       '     \  ^ 

D  =    [exp  (-k^t)      exp '(-Bok  t)  ],'• 

.  Bok^-k2 

which  is^  the  required  limiting  form. 

Calculated  curves  of  the  intermediate  concenjtration  vs.   time  are 
shown  in  Fig.  1  JLn  which  the  rate  constant  k^^  is  varied  near  1.0.  The 
curve  C  corresponds  to  the  qase*  ia  which  the  closed  form  solution  applies. 

1S4 .  ,  '  . 
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' ,  The  second  scheme  is  the  irreversible,  mixed  second  order,  second 
order  sequence.    A  solution  to  the  differential  equations  of  this  scheme 
in  terms  of  Legendre  functions  was  obtained  by  Kelen  (?)•    However,  the 
series  solution  presented  here  can  be  obtained  exactly  without  need  o£ 
tabulated  function,  and  is  readily  programmed  for  digital  computation. 

The  solution  of  the  differential  equations  is  similar  in  some  re- 
spects to  the* one  developed  above.  Some  of  the  equations  are  repeated 
for  clarity.    The  mechanism  is: 


^1 

B  +  C  i  D  (8) 
•k 

•     2D  tE  ■  .  .  .  (9) 


The  rate  equations  are: 


^     =    ky/2  '  (12)  ■ 

at  z  '  *  , 

*  ^ 

Taking  Bo  and  Co  as  the  initial  valqes  of  B  and  C,  the.  instantaneous 
concentrations  of  the  reactants,  equation  (10)  is  first    integrated  to 
give  the  previous  result:  ^  . 

Again  making  the  substitution:  •  • 

.    ^  =  Bo  -Co  .  '  ^ 


,\    \and  no'tring^the  stoichiometric 'requirement  that  B-C  =  Bo-Co,  solving  for 
B  4nd.  C  gives  as  before.;.  .  '  \  * 


B- 


GBg- 


"Bo-Co  .exp  [-Gkj^t.}.' 


•and 


C    =5    B(Co/Bo)  exp  (-Gk^t). 


»'  •  / 
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The  time  rate  of  change  of  the  intermediate  concentration,  D,  is  given 
by  equation  (11)  or 

.         k.G^ (Co/Bo)  exp  (-Gk't) 
dt  2.  [l-(Co/Bo)    exp  (-Gk^t)]2  qi^) 

Now  let.Z-=  (Co/Bo)  exp  (-Gk.t)  and  let  7)  =  ^2/1^^^'.  The  differential  eq-uation 
(11a)  can  then  be  transformea  to: 

2 

dD     2jD_    _    -G    _  , 


dZ  ■  6Z      ~    (1-Z)2  ^  Q2) 

This  is  a  Riccatti  equation,  and  may  be  transformed  to  a  second  order 
IjLnear  equation  by  the  change  of  variable 

-GZ  u' 

°    =     ^    u  (13) 


resulting  in: 

Z(l-Z)2    •^.+     (1-Z)2    ^    .    77U  =  0         ■  ■  (14) 


dZ 

This  equation  can  be  solved  directly  by  the  method  of  Frobenius.  First 
assume  a  solution  of  the  form  •  "  • 

■    •    u    =    z''    I'a.  Z^-  (15)  • 

.  f  * 

•V. 

Substituting    (15)  into  the  differential  equation  and  equating  the  co-  ^ 
efficient  of  the  first  term  to  zero  results  'in  tj^e  indicial  equation 
r    =  0/   One  solution  can  then  be  found  from  the  series  expansion 

'   u      =  Ta.        '  '   ^       '  .  .      •  (15a) 

*  1        .      J      ♦  ' 

Substituting  the  expression  (15a),Lnto  the  dif fei:ential  equation  (14)'^, 
an(J  setting  the  coefficients  of  Z*'  equal,  to  zero  gives  ^  the  following 
relations  among  the  a^:      •  *  '  . 

^    =    a^rj.  \;  ^  -(16)-  . 

Vl    ^    L(2j^  +  T])  a.  -  (j-l)^a._^]/(j+l)^     ,  (17) 

Th^  coefficient  a  t  can  be  chosen  arbitrarily  ar^d  othe^rs  can  be  obtained 
using  the  recursion  fonnulas  (16)  and  (17). 

•  A  second  order"  dif  ferential  equation  has  two  linea-j;ly  independent  - 
solutions,  and  the  second  solution  has  the  form, 

u'    =    u.    -InZ    +    Lb.  .    '  "(18) 

j=0  .       ■  .    •  . 


By  substituting  equation  (18). for  u  in  the  differential  equation  and 
equating  coefficients  of       to.zerp,  it  is  found  that  b    is  arbitrary  and 
the  other  coefficients  are  related  by  the  recursion  formulas: 

b^    =   rjb^    -    2a^  '  (19) 

|ljj+l=i:*2Q+l)a._^^4Aja.-2(j-l)a^_^+(2j  ^  r?)b.-Cj-l)^  b  ._^]/y+l)2     (20)  ( 
The  concentration  o£  D  is  obtained  from  equation  (13): 

■•   ■■■■  ^T^^€= 

"  -r^  j-ri;;^   ;  <i3-'- 

wher^  X  is  a  constant  of  integration,  apd  the  terms  in  the  numerator^ 
are: 

'^"1  =  la.jzj  /  (21) 


and 


7n 


u. 


dZ        dt  ^  "i 


The.  integration  constant  \  can  be  obtained  from  the  initial  condition, 
say  at  t=0,  D=0  and  Z=^-Co/Bo«    The  condition  that  the  rfti^^^ltor  in 
(13a)  must  be  zero  at  t=0  gives:  ^^f^-  ^.  i 


^  Z=*Zo 

^  dZ  ) 

?du2] 

[  dZ  / 

Z=Zo 

The  Concentration  of  product  E,  is  obtained  by  difference,  viz. 
E  =  Eo  +  (Co-C-D)/2. 

The  differential  equation  (14)  has  regular  .singularities  at  Z=0 
and  Z=I..    It  can  be  ptove9(14;  that  an  expansion  about  Z=0  has  a  region 
of  convergence  0<2<1.     Now  it  can  be  assumed  without  loss  of  generality 
that  B6<Co  and  G<0  because  of  the  symmetry  of  the  e^ations  with  respect 
to  interchange  of-  B  and'C.     Thus^  it  is  evident  that  Z<1  for  ail  t»  The 
series  is  therefore  uniformly  cot1[vergent« 

K  Sample  cal\:ulations  were  performed  in  which  the  infinite  series 
were  truncated  when  the  last  term  for  u' 2         less  than  c,  and  it  was 
found  that  the  number,  of  terms  required    decreased  with  increasing  values 

of  t.     iPoT  example,  when  Co  =  1.0,  Bo  was  2.0,        =  ^2  =  1*0  and  e  =  10  '  , 

28  terms  were  required,  at  ,t  =  0.1  and^lS  terms  at  t  =  0.6.     The  concen- 

.  tratians.^f  intermediate  computed  were.  0^17271636  and  0.53396965  respectively. 

^  '  •  -6  .  ^ 

Repeating  the  calculations  with  e  =  10      requited  19  ^Hd  13  terms 

with  , cone entratibns  0.17/271640  and  0.  53396835  .respectively.  , 

The  integrated  rate  equation  has  b€»en  programed  in  Fortran 
and  copies  of.  t|Y^  dompiiter  program  can  .be  obtained  from  the  authors. 

fcalculat;&d  curves/ of  the  Intermediate  concentration  vs.  time  are 
shown  in  Figs*  2  and  3  in  which  Bo  and  the  rate  constant  k2  are  varied. 

/  \  ,  ^  ^ 
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Figure  Captions 

Concentration  of  intermediate,  D,  vs.  time,  for  the  mixed 
second  order- first-order  mechanism* Co  =  1,  Bo  =  2,       -  1- 
A:  kj^  -  0.8;  B:kj^  =  0.9;  C:kj^  f  1;  D-.k^^  =  1.25. 

Concentration  of  intermediate  vs.  time  for  the  mixed  second 
order-  second  orders  sequence-  Co.  =  1,  k.  =       =  1  A;  Bo  =  2.0; 
B:  *Bo  =  3.0;  C:  Bo -=  1.75;  D:  Bo  1.25. 

Concentration  of  intermediate  vs.  time  as  in  Fig.  2  but  with 
Bo  =  2.0;  k'   =  1^0;A:  k    =  1;  X:  'k    =  8.0;  Y:  k,  =4.0; 
Z:.  k-^  =  2.0;  W:  k2-=,0.;5. 
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Computers'  In  Chemical  Education 
^  by  Joseph  -Lagowski 

In  the  past  several  year^there  has  been  an  accel- 
eration of  interest  in  the  use  of  computer-based  techniques 
in^^ithe  educational  process  in  chemistry.     Initial  emphasis 
has  been  in  remedial  and/or  drill  interactions,  and  re- 
cently there  has  been  an  interest  in  mathematical  model- 
ing and  similation  techniques  .     In  fact,  there  are, now 
available  examples  of  computer  methods  to  supplement  vir- 
tually..  every  phase  6f  the"*  classical  instructional  process  I 
Pres^lited  in  this  symposium  are  papers  illustrating  many 
of these  points  .in.  detail,  as  well  as  papers  dealing  with 
the  broader  aspects  of  using  computers  such  a5  languages, 
strategy,  result s.> 

The  call  for  papers  for. this  symposium  yielded  more 
than  three-.times  the  number  6f  papers  than  the  organizing 
committee  ^anticipated.    Accordingly,  some  rather  arbitrary^ 
decisions  had  to  be  made  even  aftef^  an  ^extra  session  was 
accomodated' into  an  already  crowded  schedule.     From,  this 
experience  it  is  apparent  that  the  paper?  presented  here 
ref3jact  only  a  part  of  the  present  activity,  in  the  field. 


The  Development  of  Computer'^  Based 

Instructional  Systems  in  Chemistry 
S^.  J.  Castleberry,  Dept''.  of  Chemistry 

 The  University  of  Texas,    Austdn,  Texas;  78712   

Computer  applications  in  chemistry  have  usually  -taken 
one  of  two  courses:^    1)  batch  oriented  applications  such 
as  large  number  crunching  jobs  and  teaching  programming  ^ 
for  the  solution    of  chen\^ical  problems,  of,  2)  CAI  pro- 
gramming of  tutorial/remedial  drill  jnaterial*     In  the 
fiX'St .  Application  the  most  widely  used  languages  are 
.FORTRAN  and  BASIC;  the  letter  application  usually  involves, 
a  very  specialized  and  highly  hardware  dependent  lang- 
uage and  frequently,  .a  dedicated  hardware/software  system.. 
Examples  of  CAI  languages  are: 

1.  The  Coursewriter  languages 

2.  TUTOR  ^      ^  ' 

3.  . Scbolar-Teach 

4.  PICLS 

5.  ^  CLIC      -  -  ' 


6.  PLtI 

7.  CATALYST/ PlL 


Some  of  the  better  known  dedicated  systems  are: 

1.  IBM  1500/1800  Instructional  System- 

2.  PLATO- 

3.  H/P  2000B 

The  two  types  of  applications,  batch  oriented  and  CAI,v 
are  not  mutually  exclusive.     Indeed/  a  flexible  "approach  . 
to  applying  computer  techniques  -to-^chemical  edycation  (and 
general'  education)  requires  that/you  devise  a  strategy, 
without  reg-ard  to  hardware  or  language  ,^  which  involves  all 
c£  the  educationally  valuable  applications  of  computers, 
ye  would  suggest  that  such  a  flexible  strategy  should  in- 
clude the  following  characteristics: 

1.  /  the  application  will  supplement  the. existing 

6ourse,  not  replace  it. 

2.  the  applications  5  particularly  .CAI ^will  be 
designed  to/help  students  leara,  no't  necessar- 
ily teach.  ^      '  , 

3.  the  application  will  help  individualize  stu- 
den1j  experiences. 

When  computer  techniques  are  presented  to  students 
as  supplemental  material  they  tend  to  view  the  comt^uter  ^ 
as  .an  aid  and  a  chance  'for  extra  help  rather  than  another  ^ 
dehumanizing  barrier  between  the  studen't  and  instructor. 
/\s  a  supplement,  the  computer  can  remove^the  burden  of 
stereotyped  activities  from  the  teacher  and  allow  him  more 
time  for  the  activities  which  he  perf orms^ best .  / 

Th^re  are  two  major  aspects  to  the  educational  pi/o- 
cess:  the  teaclier^  teaching  and  the  student  learning.  \A11 


i 

too  frequently  teachers  become  Overly  involved  in  hefping 
students  learn  rather  than  teaching;    That  is  they  have 
the  burden  of  assigning,  grading  and  giving  students  feed- 
back on  homework  and' tests;  helping  students  with  their  ^ 
assignments ;.  and  conducting  tutorial/remedial  drill ^group* 
interactions.     To  .a  large  extent  the  computer  can  perform 
these  tasks  {on  an  individual  basis)  as  well  as  or  better* 
than  the  instructor,  for  the  computer  can"  be  programmed 
to  be  the  world^  most  patient  tutor  and  has  the  capability 
of  interacting  with  the  student-  as  often  and  for  as  lorig 
as  the  student  requires.     This,  of  course,  does  not  dimin- 
ish the  teachers  contact  with  'students  but  rather,  makes 
it  possible  for  the  teacher-studen^  interaction  to  be 
richer,  in  the  activities  which  teachers  perform  bes1^: 
giving  insights  into  difficult  concepts,  transmitting  an 
understanding 'of  abstract  ideas,  inspiring  st^adents ,  artd 
obtaining  behavioral  oferjectives  in  the  aff^ective  domain. 

Computers  can  be  used  to  individualize  student  ex- 
periences in  several-  ways.     CAI  programs  ^ can  be  used  to 
measure  entering  skills  and  prescribe  a  series  of  programs  ' 
which  contain  remedial  materials  if  nedessary,'  standard' 
curriculum  materials,  and/or  advanced  placement  materials. 
In  addition,  each  program  can  branch  students  ahead,  pro-f 
vide  extra  work  or  help,  or  branch  back'^on  the-basi«  of 
the  students ^apptitude.     CAI  programs  ckn  allo.w  for  indi- 
vidual differences  in  learning  speeds  bj^^  allowing  stu- 
deiats  to  take  a  module  as*  many  times  as/necessary  and 
woTk  for  as  long  as  desired  or  necess^y.     When  the  stu- 
dentfe  .interest  dictates,  modules  which  .supply  •  specialized 
'  or  enriched  materials  can  be  supplie<J%; 

C^I  programs  can  generally  be  classified  as  tutorial 
drill,  simulation,  or  evaluation.  Witir-cpnPputer  baaed-ihter- 
active  programs  it  is  possible  to  provide -the  student  with 
tutorial  drill  materials  (practice  problem  sets,,  question 
and  answer ^ sessions ,  problem  situations,  etc)  which  are 
tailored  to  his  needs  ^and  are  , unique  to-  hiin.  Simulation 
^progi?ams  can  be  used  to  extend  ^  student  *s  laboratory 
experiences-, to  much  greater  depih  than  -ever  before  . possible  , 
that*is,  the  student  can.  ^perform  a  greater  variety  of  ex- 
periments and  mo^e  of  feach  variety  and  each  time  collect 
his  own  set  of  unique  data^     The  time*  compression/expan- 
sion  capability  of  the  computer  allows  the^  student  to 
perform  e:^periments  which  in  the  real- world  occur  on^a 
^Ve^ry  "  short  time  base  or  a  very  long  time  Uase  (ie.  kirietic 
studies^.     In  addition,  comp'yter  simulat'iODS  allow  students/ 
to  perfo;rm  experiments  which  are  too.  dange;rous  fov  begin-  / 
ning  students  to, perform  on  a  large  scale  in  the  real  / 
laboratpry  and  to  perform  experiijierlts  which  are  too  sophis- 
ticated arid  require  too  expensive  an  apparatus  for  wi(je 
soale  use  by  beginning  studohts*  /-  ' 

•  Imbedded  within  ^the  set  of  CAI  programs  c^n  be  a  series 
of  minimum  level  examinations  which*  tell  the  student  ^^nd'-^- 
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instructor  ti^at         Student  has  Tnastered  a  given  concept 
.or  task  and  is  ready  to  mpve  on.     On  the'ba*sis  of  these 
minimum  level  exams  the  student  can- be/ moved  ahead,  given 
additional  tutorial  aid/  or  the  instructor  can  be  alerted 
that  the  student  is  experiencing  unusual  difficulty  or 
not  taking  the  exams;  lin  either  of  the  latter  casee  the 
instructor  is  provided  with  the  timely  information  that 
he  should  intervene  at  this  critic^  point  in  the  stildent's 
, progress*     The  students  can  take  t^ese  exams  as  ihey  feel 
prepi^red  and  if"  net:essary  they  can  repeat  them  several 
times,/   This  is  m^de  possible  by  the  computers  ability  .to 
randomly- generate/  approximately  e^quiyalent  questions*  and 
parameters , thereby  making  each  e^am  for  each  student  unique. 

*The  compute^  allows  us  to  "/capture"  the  thoughts  and 
strategies  of  our  most  capable  people  and  make  them  avail- 
able to  large  numbers  of  sti;den/bs.     It  is  then  possible 
for  the  pcogramfe"  to  incorporate^ rich  socratic  dialogs, 
discovery  techniques,  and  response  contingent  feedback 
and  branching. 

•Once  the  gfeneral  strategy  of  computer  applications 
has-been  set  comes  the  task  of"  adapting  the.  avciilable 
hardware  and  languages  to  fit  the  strategy.     For  smaller 
institutions  with  moderate  numbers  of  students  to  serve, 
'  there  is  a  widp  variety  of 'm^ni-computers  and  languages 
which  can  be  a'dapted  to  most  educational  applications  • 
A  few  of  these  are : 

r.     PDP    TSS-8  /  BA^IC 
•    .    i:    H/P  2000B  /  BASIC   ^    ■  ' 
.    3.  .  WANG'  3300  /  BASIC 
U.     ?DP  -  11  /  BASIC 

5.  -    Super,  NO^A '  /  BASIC  i 

6.  .  PDP      12  /  FOCAL 

For  large . institutions  which  require  vast  amounts  of 
compute    pq'wer  ther  is  an  even  wider  variety  of  time  sharing' 
systems  and  languages.     Some  of  thes^  are: 

1.     IBM     360/-         ,370/-  series  - 

.    ,  a)     APL  • 
.b)  "  .FORl^N 
'c)  • Coursewriter  III 
d)  BASIC 


ERLC 


2. 


CDC^  ,600'0  'series 
.a)  FQRTMN- 
b)  '  BASIC 
CLIC 


c)  , 

G.E 
a)  . 
b) 


635  (or  MARK  LI  r.s.-> 
BASIC 


FORTRAN 

PD^  -  10  V 

a)  BASIC 

b)  FORTRAN 

•>  • 

others 

.a )  F.OKTRAN 

b)  Basic 


t  ■    ■  ■ 

,    The  question  of  what  language  olo  languages  to  use 
always  arises.     On  the  mini  computer  the  answer  is  usually 
BASIC*     We'.have  found  BASIC  to  be  an  excellent  all  around^ 
.language  for  both  batch  number  crninching  arid  CAI  pragram-. 
ming.'.'  Ojie  oari  do  with  BASIC  almost  everything  you  cah  do 
with  a  specialized  CAI  language  and  BASIC  is  a  veTy  easy ^ 
-language  to  learti.     APL  is  a  powerful,  relatively  easy 
ta  learn  language  which  is  as  adaptable  to  CAI  as  BASJC^ 
but-  is  not,  as  widely  -available  on  different  hardware 
system  as  is  BASIC.  '  The  other  computational  languages 
such  as  FOkTRAN  are  also  ameanab]^e  to' CAI  programming    •  ^ 
but  are  more  difficult  to  learn  and  ^^ecjuire  personnel 
with- a  high  level  of  language  expertise. 

Specialized  CAI  languages 'are  usually  e5Sy  for' 
authors  to.  use  but  tend  to  suffer  from  a  lack  of  ,compu- 
-tational  capability  and  extreme  machine  depend^e  thereby 
decreasing^  their  value  as  ini:act  transf  errable  curriculu^n 
materials .  ^  ^ 

kt'the  University  of  Texas,  the  computation  center 
has  ^developed  ^an  author  language  called  C'LIC,   This  lang- 
uage has  the  easy  to  use  operation  codes  of  a  course  writer 
type  language  and  the  cortpute    power  of  FORTRAN.  The 
language  is. in  fact  written  in  .FORTRS^N  IV  so  that  curricu- 
lum materials  may  be  more  easily  transferred  to  other  sites 
with  standard  FORTRAN  IV  conjpilers.     When  coding  in  CLIC 
it  is  possible  tp  us^  only  the ^op  codes,  use  only* FORTRAN 
or  intermix  op  code  and  fortran  coding. 

We  are  currejitly  using  the  CLIC  language  on  the 
computation  center  CDC  6600/6400  system  and  the  PDP-IO  ^  ' 
system  to  develope  a  fully  computer  supi5]^mented  introduc- 
tory chemistry  course.  'The  number  of  formal  lectures  per 
week  will  drop  from'^S  hours  to  1  hour.     The  student  will 
use  tKe^se  two  ho^urs  per  week  for  individual  and  small 
group  interactions  with  the  teacher,  tape  Recorded  tutorials 
and  discussions  of  tests,  filmstri-p?,  integrated  f*ilmstrip 
and  tape  recorded  tutorials,  CAI  ^^programs .     The  laboratory 
j>7ill  consist  of  a  series*  of  integrated'  real  and  computer 
simulated  experiments  which  will  attempt  to  optimize  the 
student  ^  s  -experience' with  real  lab  situations  and  the] 
basic  skills  of  laboratory  work  and  with  analyzing  anti 
solving  e'xperimental  problems  illustrating  basic  ch§;inical 
concepts .  ^  '  '  ^  *  ^       *  " 

Computer  administered  and  scored  tests  wil*!  be  used 
to  'obtai+i  a  beginning  skills,  profile  of  each  student:.  On 
•the  basis  of  these  *re'sults  the  computer  will  generate  a 
suggest^  prescribed  course  of  individual  oomput^'^;|^nd 
audio*-visual  materials ,  suggesting  advance  placement  or 
remedial  work  where  indicate<i.     The  'suggested  course  must 
of  course  be  approved  by  th6  instructor.    -Once  the  student 
begins  the  course,  minimum.  l^eVel  exams  will  be  .administered 
^.via  the  computer  as  the  student  progress.     Accurate  records 


are  kept  which  the  instructor  has  access  to  and  can  retrive 
at  anytime -and  on* the  basis  of  these ' records  the  computer 
will  send  timely  warnings  when  the  student  is  not  progressing 
(on  the  basis  of  criteria  set  by  the  instructor)  as  he  should. 

Although  the  approach  we^ghavQ  discussed  may  sound 
appealing,  many  will  feel  tKat  cost?  may  Ire  prohibitive 
or  that  the  logistics  of  getting  started  may  be  overwhelm-  ' 
ing.     if  yoitoare  in  a  large  institution  that  already  has 
large  computational  power  and/or  a  time  sharihg  system  the 
major  capitol  outlay  of  money  is  already  over  the  hill  and 
its  simply  a  jn^tter  of  convincing  people  of  what  direction 
to  take. 

If  one  has  little  or  no  avetllable  compute  power^  mini- 
cjDmputers  offer?  ah  opportunity  to*  begin  ah  effort  on  a  small 
scale  at  a  minimum  investme;lt,  25000  to  75000  dollars  (ac- 
tually'as  low  as  1800  dollars  for  a 'jFour  teletype,  hardware 
system).     The  . important  first, step  is  to  devise  objectives 
and  set  the  strategy, or  developing  applications.  Then 
with  objectives.  an,d  stratigies  in  .mind,  inventory  the  avail- 
able hardware    oo  cajiipus  and  monies  available  for  support; 
school  ^in  existance,  possible  grants  available  via  a  new 
proposal..    Personnel  to  help  with  the  project  are  often  one 
of  the  easier  aspect's  of  getting  started.    Usually,  some 
graduate  assistants  can  be  made  available  part  time,  com- 
putation center&\are  often  interested  iu  new  applications 
and  wjLll^make  available  some  manpower.     Some  of  the  devel- 
opmental problemsVpf  a  n^w  system. can  provide  thesis  and 
dissertation 'problems  for  candidates  in  Computer  Science 
and  Science  Education.     Tl;*ere  is  also*  the'  possibility  of 
acquiring  undergraduate  assistance  either  through  the  work- 
study  -program  or  volunteers « 

Computers  hold  great  promise  for  enriching  and  adding 
flexibility  to  the  eduCatTonal  process,  indeed  much  of  that 
promise  has  already  been  demonstrated.    What  is  required 
now  is  the  eyo3.ution  from  demonstration  projects  to  large  » 
scale  application.- 

A 
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The  following  is  a  Ibrief  discription  of  the  CLIC  Sys- 
tem as  written  by  Mis^ ■ Sally -Hartman  of  the  University  of/ 
Texas  Computation  Center.    .    ~  ' 
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'  .  Tfif  GLIC  system'  , 

;^  The  purpose  of  the  system;  CLIC  (Conversational  Language  foij^ 
Instruction  and  Compuxing),  is  tp  incorporate  an  interactive  in-  - 
stru'ctional  capability  within  a  large-scale^  general -purpose,,  time- 
shared  computer  system  ^the  Control  Data  Corporation  6600  computer 
andL  the  UT-2  operating  system  in  use  at  The  University  of  Texas 
Computation  Center),  in  a  fashion  that  makes  maximum  use  of  the  . 
system's  resources  Jboth  in  its  structure  and  application. 

Approach         .  .  '  ^ 

In  the  summer  of  1970, a  prpject  was  begun  at  The  University  of 
-Texas  to  provide  u^ers  af  a  general-purpose,  timersh^ring  System 
with"  the  capability  for  fexpXorii^g  conversational 'uses  of  computers 
in  instructre^i^  The  idea  for  the  CLIC  project  developed  from  the 
interest  of  largely /Computer  Sciences  Depirtment  and  Computation 
Center  faculty  anti  sVaff  member^.  .They  wished. to  develop  the  uSe* 

instructiojur  *conve!tsational  programs  on  the  CDC  6600.  and  to 

-  ' »  *        «  * 

investigate  the  benefits  of  CAI  in  the  classroom  and  laboratory 
using  the  existing  time-sharing  facilities^.and^oth(&r»Toutines 
within  the  sccJpe  of  the  machine.  ''Support  was  given  ty  Control  i}^ta 
.Corporation  for  .thd  project  development*        ,  '       .  , 

"   The, approach  taken  in  the  design  of  CLIC -was  for  the  most  pajrt  ^ 
>  technical  on^;    The  design  grouiJ  wa?  composed  of  computet  scien- * 
*  tists,  ^ot  behavioral. or  information  sci'ejiti^ts ;    The  group  had  only 
faint  knowledge  of  previous  CAI.  endeavors .    Local  individuals 


experienced  ip  the  field  were  contacted, and  their  suggestions  vyere 

incorporated  into  the  CLIC^systein.         '         .  ;  '  ' 

The  CtIC  systetn  in  its  initial  stages  gives  to  the  typiifal  CM 

user  the  capability  which  he  seemed  to  wan?  on  a  large,>  general-  . 

purpose  computet.  .  This  capability  mainly  consists  of  a  COURSE-. 

IjfRITER-like  (quisst  ion-answer),  logic  tliat  is  easily  learned  by  an**.. 

author;  it howeVer^  top  rigid  to  be  used  solely.    This  serves  ; 

oWy  as  ^  .pivot  point,  since,  the  CLIC  design  is  extremely  ropduljar 

>nd  the  addition, bf  other  logics  (sttategies)  can  trivially  be  in-- 

cbrp^rated.    The 'general  (skeleton-like)  structigre  that  is  provided 

.by  the  CLIC  system,  is  imperativi^  to  a  flexible  teaching  scheme- 

which. can  tryly  be  ^iynaraic  and  indi'viiiualized,     ^  _ 

*  •       '  •  '  .'^      *  .  - 

Simplicities  .Flexible  L^n^age  . 

*v^s  previous ly.-mefttioned,  the  CLIC 'language  bears  a  number  of.  , 
.sirailarities  to  the  y^'ell-known^  widely-used  COURSEWRITER  language. 
Due  to  this,. CLIC  appears  like  a  typical  CAt  system  to  an  inex-  * 
periented  s^uthor,  providing^ hirti  with,  an  extremely  simple  instruc- 
'tipnal  language.  ^ 

'  However,  for  an'  experienced  user  CLIC  differs  substantially 
fromvmS^st  previous  systems*.   An- author  who  feel?  restricted  by  the 
CLIC  language  ;^lways  possesses  the  option  of  dropping  into  Fortran 
whenever  he-  wishes/ 


) 

r  I 
V 

Significant  CLIC  Design  Features 

The  fact  that  CLIC  is  implemented  on  a  larg|e,  general -purpose 
computer  is  not  solely  a  significant  design  feature.    The  majority-  ^ 
^  of  the  previous  CAI  systems  that  have  been  designed  for  and  imple- 
mented on  similar  machines,  are  usually  rigid,  enormous  and  completely 
self-contained  instructional  systems, 

CLIC  is. not  a  self-contained  system.    It  is  the  use  that  CLIG 
makes  .of  the  machine's  potential  and  tht  degree  to  which  CLIC  is 
interfaced  with  the  computer,  that-^-identif ies  the  important  charac- 
teristics  of  its  design^  and  proposes  to  provide  a  powerful,  inte- 
grated and  flexible.  CAI  capability  to  the  CDC  6600  computer  system. 

Every  module  comprising  the  CLIC  system  utilizes  the*  existing 
software  whenever  practicable.  .The  major  faci;ities  it  makes  use 
of  are  briefly  commented  on  in  two  parts:    (1)  an  account  of  CLIC's 
use  of  the  system's  Fprtran  compiler  is  covered  in  the  discussion 
of  its  "Preprocessor"  feature,  while  (2)  the -remaining  system  lan- 
guage processors  it  uses  (i.e*,  available  editing  packages  and  the 
TAURUS  time-sharing  and  permanent  file  systems)  are  sketchily  dis-  ^ 
cussed  under  "Use  of  Available  Software.*' 

(1)  Preprocessor 

the  CLIC  system  incorporates^ the  feature  of 'a  ^'preprocessor", 

ft 

for  the  purpose  of  using  a  predefined  and  preoperational  computer 
language  and  its  respective  processing  system*.    The  preprocessor 
is  the  "CLIC  Translator"  and  the  language  exploited  is  Fortran. 


CLiC  language  <levtIop6ct  as  an  augt^ntatl^  and,r«fineatnt 

of  .the  CCAJRSEWRITER  language.    It  waSjiade  an  extarAlpii  (a  higher 
Idvel  language)  to  the  pprtp^h  language  by  affixing  a  ^'pteptoqessor'* 

•fr..^:;-  -   :  / ;     . .         \  *  .  ;      ^      ,      ^  \  ^\ 

^tHat  would  trari^la.t^e  CLIC  source^  progxaa^  to  Fortran  source  programs. 
Cl^C  .etoploys  tKe.  Rortjran  language  as-  its  "basic"  systtn  language 
(i.eV,  th^  langu^iiJd.of.  its.  ccSiu:;k^,progr|ap5)  and  titilizes  the  Fortran 
•  cdapil^er/'is  its  sys'tpsr  1^^^ 

6nqe;a  course  .fttsgtsjl^.  ^^ti^:  be^n 'Vcoi^^iied^'  (convtr'ttd  into  hi|5[hly 
optiijfized^atachijie.  tode),  ,the  course.  jC^rograjk  is  rt^^dy  for  student 
•ititeraction.s  Our         ^'tiidfM',  sbdsioti.  th^  i$  ^*exetuted^*. 

(prck:e5S^d)  b.y/t^e  '}ei.i(J -InStejcpretex^V^*^ 


V        Facilities  for  cour.Sd  coinpo$itii£)n  ^tid  rjeyision/,f or.  aonitpring 
."teiecoUBaunicationS,  and^  fox'.fjLXe  .toinipuU  Wt  been  ipQlud^d 

.S/^tJiin  %ht  CLIC  system.    When  an-aiit)iot  Creates  his  cpufse  hfe  . 


his  choice  of  ,usi^g  any  6^ !the.'avail^blf  editing  routines  to  build, 
•f^xaroine  and  modify,, course  program  file^    Wheti  a  User  converse$ 
.With-'a  course  via  CLIC-j  ht  util^izes  the  TAURUS/time-sharing  system 
whicli  takes  care'of  all  the  information  transmitted  during  the 
interaction*    kl\  the  necessary  file  maintenance i and  file  mani- 
pulation  is  autoftiatically  accomplished  by  CLIC  via  the  local  Permknent 
File  System.  .  ^     '  .  .        "  r 


Evuluation-        .  •  * 

A  general  but  relatively  accurate  aj\d  comprehensive  evaluation 
of  the  CLIG,  (feslgn  features  can  be  accomplished  by  listing  the  ' 
ad^i^ntagts  an(i  disadv^uitages  of  tjhe  fo'llowing  two^CLIC  system  charac- 
teristics:  ,   '  c^.   '\.  ^  ^'^    '      ^  -  ' 

Mse.of'a  large,\^eneral*-'pur|>d5^.,  timi-sharing  computer  system 
'*whi<:h' t^ke^  aa/intsge  of "  the -s)^*tem*-s,Gjftp^bilit'ie5.  • 

•   ay  ,  It  .iqak>$  i^-je^^^^ 

/  ,  u^6i^4  ,ipore  ;>xt^nsriye  s^'ftwafie''t,h^n;iw^  to '.them  on 

t..  \  It->'5:motey>fon^*mickii^^ 

'  •  sysiefD{  ybjbfpr^  'XiAl.  -cehi^fs^  si^nt  l&tge  '$yms  •  of^  iidne)^jto 

oht^iti  i  ;tl^iGa]tf^dr  :c6mpi|ter  :orf-*}tiHie-$hax       -listr^er-  system 
\  in  bloc^r-ofVeiOT^       time;-   TheiT  u^e'tisually.^ 

i      . ;    ;  costly^  •  ^'inqe  «;tHe)j  .BaVe  tq  accb Jfft '  for  -  {I)  \  ^heir  4vai>lab lei 

/  '  \     time.,  whic-Ji  i^;.^^  tlve.  corapui^p /!^idle^ 

,       ^      waitit^g /fp*' th6  user  C;i]Jsth'ei4:i'^^s6i^ 
\    ;      .  /  use  pf  spefef^i'  l/O"  devices,-  'Which '^a;sra'.rc>sulf  ;of  mbj^^  '  ;^  ,  .\ 
^;         ,\   coiiipiitet  l4ie  tlme,  ;are/,^^  \ 

c*  It  ^Ipw^  ^uth^oi^]  to ^ extract,  previously  written.  toiitiTies^- 
'     .        ftom  ^P.^er  system  ies^^  sit^deints^ ; 

,  ^  \.  '     alike^)  fpt  dpplica;ti<>n  in"his  courser.  '■SUch.,c6njnonVbu^ 
fop  exjmtple^  c^m^^t^^^ 
\     .         /iation,  infprm^tio^P^xetrievai, 'iy<>  p?^^^^^  si^eci^l 
\-  •  ?quipment^         ,      shbrti.  course*  authors -are  given  tho 
,         ,    potential  to  rebate  with  others  in  distinct  a^pplicatioa    -  ^ " 
a;rea^  and  to  sh^t^  auxiliary  routines. 
*d.     Since  CAI  time-Shares  with  some  of  the^ost  tecliriiically. 

adyanc^id  in  the  coiiputelr  field,  ^it  is  given -the  opportunity 
/  r     I'    and  is  ^hqwn  ^^h^  need,  t6  refine  its  interaction,- 

Wh^n  a  CM  s^ysteni  is  integrated  within  a  complex  operation 
^  V     system  and  linke4  to  Existing  software,,  its  capabilities 
"    \  are  iipmedij^ately  provided  and  promptly  improved  witji  a 

^      •      ; minimum  investment  in  system  programming  and  program 

^     documentatiqn*     .  .  -        .  • 

'd*  /  It  gi^es  the  regular  user  of  thii  sysitem  another  utility 
toutfne  that  Can  help  him  in  his  work*    Fot  example,^ a 
.   V^er  can  interrogate  CLIC  for  a  quick  progM^  review 
of  a  sut^ject  (w;thQut  leaving  hlis  terming  in  otder  to 
detect  his  error  wh0n  using  some  systeri/routine* 


g*    This  instructional  system  adds  another  application  area  to 
a  large  centralized  computer  system,  which  will  financially 
help  the  computer  center  in  its  continual  effort  to  provide 
overall  efficient,  reliable  computing  power. 

Disadvantage^: 

a*    Jhe  large  system  does  not  provide  the  wide  variety  of  I/O 
,  devices,  as  do  most  of ^the  smaller  machines  that  are  dedi- 
cated fo'.CAI  and  extended  specifically  for  use  in  education. 
/  Equipment  such  as  individual  JLIRT's,  slide  projectors,  re- 
corders, etc.,  is  extra  from  the  actual  computing  service 
used  by  CLIC. 

b.    The  author  who  writes  a  course  is  expected  to  know  a  little 
about  the  computer's  software.    He  must  be  knowledgeable  in 
the  basics  of  additional  language  processors,  such  as, 
TAURUS,  an  editing  routine,  the  Permanent  File  System, 
the  Fortran  languagd,  etc.) 

Use  of  a  preprocessor  to  define  the  basic  CLIC  system  language- 
to  be  Fortran. 

Advantages: 

a.  Provides  the  obvious  advantage  of  ease  of  codings  and  check- 
out. 

b.  :  Immediately  provides  full  Fortran  capability  with  little 

programmer  effort  and  extra  program  documentation;  these 
.  -'include  computational,  sequencing  (e.g. ,  GOTO's,  conditional 
and* unconditional  branching,  function"  ^nd  subroutine  features), 
logical,.  I/O,  formatting  facilities  and  access  to  ,a  large 
'  libxarx.  of  *f unctions:    .  . 
6.    Prevides  that  most,  thprough  linkage  to  routiriesli^nd  packages 
within  the  operating  system.    The  CLIC  system  is  automatically 
given  access '^to  the  system's  program  library;  system  I/O  and 
editing  packages,,  permanent  file  and  file  maintenance  rou^ 
tines^  special  packages  for  ddta  manipulation  (SLIP),  display 
routines  .(plotter,  film',  CRT,'  etc.'),  are  immediately  avail- 
able, to  CLIC'users. 

d,  Fortran  subroutilie  capability  makes  the  CLIC  system  and  its 
language  readily  extendible.    It  allows  for  easy  addition  to 
th,e  set  of  answer-processing  functions  that  enhance  the 

'  system,  and  to  the  set  of  auxiliary  functions  that  expands 
the'author  language,  thus  increasing  user. capability . 

e.  An  author  who  is  restricted  by  the  CLIC  language  may  inter-  ' 

■  sperse  actual  Fortran  .code  or  intermix  supplementary  routines 


with  his  CLIC  section,  o;t^  he  may  completely  replace  a  sec- 
tion with  all  Fortran  r(^utines.    In  short,  the  scoop  of  the 
resulting  language  for  tourse  programs  is  extremely  flexible; 
'       the  use  of  Fortran  provides  an  author  means  of  leaving  the 
higher  level  author  language  whenever  it  becomes  too  restric- 
tive . 

•f.    Compiled' Fortran  code  yields  more ^ efficient  data  for  the  CLIC 
driver  during  interaction,  than  interpretive  data.    It  pro- 
vides a  more  efficient -execution  (i.e.,  faster  response  time) 
of 'CLIC  programs  during  an  instructional  session,  since  the 
immediate  course  section  is  completely  in  core  and  only 
minimal  I/O  -  that  to  the  student's  terminal  -  is  required. 

g.  Fortran  is  relatively  machine  independent.    Source  code  for 
both  the  CLIC  system  and  its  courses  can  be  implemented  on 
any  suitable 'large,  general -purpose* computer;  courses  that 
are;iWritten  fbr  one.  computer  can  be  easily  recompiled  for 
another  which  contain?  CLIC.  •  . 

h.  The  author  is  able  to  modify  his  text  ^nd  prepare  alternative 
versions  with  relative  eatse* 


'Disadvantages: 

a.  '   Fortran  is  not  readily  adaptable  for  string  processing;  there 

fore,  no  real  string  processing  capabilities  are  present. 

b.  Preprocessing  usually  requires  more  time  in  a  system  which 
compiles  machine  code  'that  is  to  be  executed,  than  in  a  sys-' 
tem  which  does  little  to  course  data  files  that  are  to  be 
interpreted.    Even  minor  editing  of  a  CLIC  section  requires 
re-translation  and  re-compilation. 

c.  An  author  who  is  ri^stricted  by  the  CLIC  language  and  wants 
to  use  the,  potential  of  Fortran,  must  at  least  be  know- 
ledgeable in  basic  Fortran., 

di    In  core  execution  of  CLIC  programs  is  expensive *in  the.  ambient 
of  core  required.    Systems  that  interpreter  their  data  files, 
however,  need  only  that  core  required  by  the  interactxV^e 
system  since  their  data  is  stored  on  disc.  ^  ' 

e.  Clumsy  I/O  is  inhgjrent  in  Fortran;  formatting- information  - 
within  an  instructional  program  is  not  easily  accomplished 
if  an  author  is  restricted  by  the  CLIjC  language  and'  reverts 
to  Fortran. 

f.  Fortran  is  not  truly  machine  independent.     *  ^ 
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Current  trends  in  the  chemical  ijMustry  throughout  t^e  cc^untry  ' 
indicate    that  the  digital*c'omputer'\n|R.~aobn  become  an  indispensable 
part  of  chemical  experimentation  (l.>2^» 3.)  •    It  should  be'cvLeeJ*,  then,  ^ 
that  the  chemist  of  the- near  future,  will  be  required  to  understand 
and  use  properly  digital  instrumentation  and  digital  computers  in 
the  laboratory,    furthermore,  'educator^  at  colleger  ai^d  universities 
will  need  to  develop  undergraduate  curricula  providing  the  required 
backgEDWd  in  computeri/ed  chemical  iristrmnentation.^  In  fact,  the 
deraanu  for  this  kind  ot  training  by  practicing  scientists  has        ^  ; 
prompted  the  developm/nt  of  a  summer  short  cpurse  here  at  Purdue 


for  college  teache: 
sciences  (^)^ 


jth^s  in  chemistry  and  the  biomedical 


This  paper  desLriaes  the  Vi^*st  steps 'iA  a  long-range' program 
involving  the  introcrt4€!xiom  of  on-line  computer  applrcations  iij  the 
undergraduate  chemistry  laboratory  at  Purdue,    The  introduction 
has  been  made  as  part,  of  an  introductory  analytical  chemistry  course 
dealing  with  quantitative  chemiced  measurements  fjunior  level  at 
Purdue  )•    A  primary  objective  was  that  none  of  the  chemical*  and 
instrumental  principles  Currently  developed  in  the  coXirse  be  eli- 
minated, but  rather  that  these  be  augmented  by  the  incorporation 
of  the  on-line  ^'computer  into  many  of  the.  laboratory  experiments: 
The  manner  in  which  such  a  program  may  be  made  feasible  is  described 
below. 

APPROACH 

It  was  our  intention  to  introduce  computer  technology  into  the 
uij^ergraduate  chemistry  laboratory' without  slighting  the  chemistry' 
content •    The  approach  has  been  to  continue  to  include  the  same 
basic"  set  of  laboratory  experiments — excepi  that,  eventually,  many* 
will^  be  desipcned  to  require  on-line  communicat^ion  with,  the  digital 
/computer.    The  student  is  required  to  understand  and  execute  the 
laboratory  experiment  as  usual — but^,  \in  addition,  he  is  required 
to  desi^?:n  the  appropriate  programming  and  communication  el'ements  -?or 
optimum  interaqtion  between  the  computer,  experiment,  and  experi-  , 
nentalist.    'Thus,  we  envision  the  student's  achievement  to  be  qne  - 
of  experimental  design,  and  the  projpe-r  utilization  of  computing 
equipment*  in  the  solution  of  measurement  problems  m  ch^nistry* 


Specific  aspects  enphaslzcd  include:    the  general -purpose  natulre  of 
thc^'digital  computer  (i.g . ,  how  it  can  be  applied  to  a  large  variety 
of  experimental  jiroblems  vith  modifications  In  programming);  the 
computer  as  a  control  element  in  experimentation i  the  rapid  response 
bf  the  computer »  alloVip^  real-time  interaction  vith  jexperiments — 
vith  reaction  tim^s  sevfexjal  orders  of  magnitude  faster  than  the 
humsui  operator;  and  the  computer  as  an  integred  part  of  the  chemical 
ins  txniment  at  io  n . 

•f  *  *  *  ' 

To  accomplish  the  abpve  mentioned  objectives,  it  v«s  necessary 
to  recognize,  first  of  adi,  vhat  could  not  be  expected  within  the 
scope  .of  .this  course:    -(1)  we  could, not  educate,  pur  students, in  the 
intricacies  of  machine-language  programniing ,  and'  (2)  we  could  not 
pro*vide  [them  with  the  electronics  background  requisite  for  the 
sophisticated  deMgn  and  construction  of  interfacing  between  com- 
puter and  experiment.  ^ 

What  co\ild  be  done  is  to  ;;teLke  advantage  of  students'  previous 
exposure  to  ot^f-line  programnGlng  (with ^FORTRAN  usually),  which  many- 
but  not  all  sttidents — have  obtained  before  the  Junior  year  at  Pur- 
due •    We  could  also  take  advantage  of  their  exposure  to  the  Ifunda- 
mentals  of  electrical  measurements  in'*Physics  courses  to  instill 
them  with  the^modt  elementary  concepts  of  amplification,  response, 
ani  noise*    We  could  expect  to  pi>Qvide  them  vith  the  essential 
principles  of  digital  logic  rec^uilred  for  itfterface,  design. 

The  roost  imt>ortant  requirement  for  the  success  o^  the  program, 
hovevef,  vas  the  develapment  of  twp  items:-    (1)  a  modified  high-  • 
level  px5grammltig  language,  vhich  includes  versatile  data  acquisi- 
tion and  experimental  control  subroutines  vith  ^conversational  mod6  . 
calling  sequences.-'    (2)  A  general-puirpose  hardvare  package  for 
interface  design.    ^This  apparatus  would  pn)vide  forNpateli-^board 
incorporati(/n  of  those  digital  and  analog  modxiles  required  to.  com- 
plete the  interface  betveen  computer  and  experiment.   'The  general 
purposfe  interface  packagel  requires  only  that  the  student  i)€t  able 
to  lay  out  the  basic  logic  design,  timing  ^sequence ,  and  the  analog 
simplification  or  attenuation  required.    He  can  then  select  appra- 
,  priate  pre-packaged  plug-in  units  to  fntplement  hior  design.    He  does 
^  not  have  to  tend  to  such  details  as^l'evel  conversion,"  logic  con-?' 
version,  noise  rejection,  and^other  subtle  design  parameters-  Yet, 
•he  vill  accomplish  the  most  'important  part  of-.the  interCace  develop- 
ment-^logic  design. «  *  *        *  i 

PURbUE  REaL-TIME* BASIC  SYSTEM*'-' 

The  hifih-level  programming^  lan^!:uage  chosen  for  de;/elppraent  in 
this  vork  vas  "the  ^BASTC  lan^^uage  (5_) .    There  vere  several  reasons 
for  selecting  this  language:     (ll  It  is  easy  to  learn,  generally^ 
j^equiriAg'  the  order,  of  half  a  day  exposure  to  develop  eC^^od 

'        *        .         /     '  *       -       .  . 


,  •  I- '         •  . 

working  knowledge.    (2)  It  is  an  algebraically-oriented  conversational 
lahfcuage.    (3)  It  is  interactive.    That  is,  the  compiler  is  inter- 
pretive, and  therefore  compiles  and  executes  programs  lin^-by-line . 
This  allovr  programs  to  be  entered,  executed,  and  edited  on-line 
thru  a  teletype  terminal."    It  also  provides  for  immediate  turn- 
around and  rapid  er^or  diagnostics,    (k)  BASIC  is^  rapidly  becoming^-, 
a  universally,  acceptable  language.     (5)  BASIC  is  ^currently  readily 
available       comjner<iial  time-share  terminals.    %is  allows  the^ 
conven^ient -and  econoMceil  learning  of  the  liuiguage  b^  large  numbers 
of  §tiidents  for  later  on-line  experimental  .applications . 

•  It  should  be  noted  that  other  languages  (e.g.^  FOCAL*)  are- 
available  which  have  comparably  desirable  characteristics.  Even 
languages  such  as  FORTRAN  or  ALGOL  which  are  generally  conjsidered 
strictly  for  off-line  applications  may^  be  fippropi'iate,  particularly 
'for  systems  with  "load-«and-go"  capability./*  • 

The,sof-tware  system  developed  at  Purdue  w.ill^be  referred  to 
as.  "Purdue  Real-Time  Basifc/'  (PRTB)*    The  software  includes.,  fun- 
damentally, the  BASIC  compiler  available  from  Hewlett-Packard  Co., 
which  manufactured  the  computers 'used  in  the  undergraduate  lab- 
oratoly  progf^am  {see  Experimental  Section).    The  modifications 
generated  her?  at  PuiMue  have  involved  the  development  of  a  seri^' 
of  machine-language  sxibroutlnes  which  .are  directly  callable  from , 
the  BASIC  software!,  and  vhich  are  de&igned  to  communicate  in 
vari|5ty  of  ways  with  jexperimental  systems.    The  PRTB  data 'acquisi- 
tion and  contrpl  software  are  described  below". 

General -Description  of  PRTB  System^ 

Figure^ 1  illustrated  the  genered  nature  of  an  on-line  computer 
system.-  On  the  one  hand  is  located  the  laboratory  experiment,  the 
data  from^^ftiich  the  student  desires  to  present  to  the  computer  for 
data  processing.    On  the  other  hand  is  the  digital  computer  itself, 
which  15  capable  of  -  high^-speed  progrAmmed  computational  and  control 
functions.  '  In  order  to  operate  the  communication  link  between  the 
computer  and  the  experimental  aystem  an  electronic  interface  must 
)3e  establijjhed.    This  interface  accomplishes  the  functions  of  trans- 
lation (aftalog-to -digital,  digital-to-analog,  decoding,  logic  con- 
versions, etc.),  timing,  synchronization,  and  logical  control.  The 
progr'amming  language  must  take  into  account  ^the  nature  of  the 
electronic  interfacing;  and  the  interfacing  must 'be  designed  with 
the  i:Sindamental  characteristics  ctnd  (Capabilities  of  the  programming 
language  in  mind.    The  following  section  describes  the  sv^broutines 
developed  for  on-line  experimentation  within  the  PRTB  system.  A 
summary  of  these  subroutines  and  characteristics  is  given  in  Table  I. 


*Trade  Mark,  Digital  Equipment  Corp.,  ?^aynard,  Massachusetts. 
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ExperimentU.1  Input/Output  Sutrputlnes  for  PRTB/        »     *  '  . 

Data  Acquisition.    Two  different  types  of  datji  a^cquisition 
subroutine^  are  available  within  PRTB.^  They  bpth.  work  in  conjunc- 
tion with  ^n  external  clock-controlled  analog-to-di-gital  converter 

*  (ADC).    One  is  a  subroutine  (SB3)  which  should  belcalled  by  the  user's 
program  whenever  the  computer  should  be  waiting  for  the  rtext  d^a 
ppint  to  become  available.    When  the  next  datum  it  digitized,  ^B3 

.takes  the  lata  point  from  the  data  acquisition  device  (ADC),^  con-  ^ 
verts  the  latum  to  flo.ating  point  format,  and. stored  rl't  in- the 
appropriate  memory  lo6ation  for  subsequent  reference  l^^the  BASIC^'. 
program.    When  SB3  is  exited,  the  computer  returrl^  to  the  neat  state- 
ment in  the  BASIC  program  for  executio"^.    The  new  datum  caui  be  oper- 
ated on,  but,  before  the  n^xt  data  point  can  be  acquired,  SB3  must 
be  called/ aga.in.  ,      -  * 


A  second  type  of  data  acquisition  subroutine  (SB7)  is  one  which 
allows  for  the  acquisition  of  a  complete  block  of  data-? before  exit. 
The  external  clock  is  started  and  synchronized  Vith  data  acquisition 
within  the  subroutine.    The  basic  difference  between  SBT-mid  SB3  is^ 
that  SB3  is  called  to/acquire.  one  data  point  at  ^  time  and  therefore 
allows  for  program  statements  to  be  executed  during  the  time,,  between 
acquired  data  points.    Thus,  an  e^erimenter  co\ild  devise  a  ^program 
which  could  process  experimental  data  while  the 'experiment- va?  In 
progress  and  data  were  being  acquired/   This  is  referred  to  as  "real- 
.time"^  data  processing.    However,  because  the  computer  prqgpram  must 
Involve  the  relatively  inefficient  executioh  of  BASIC  staten^nts- 
between  data  points,  there  is  a  itbre  severe  limit  on  the  speed  with 
which  data  can  be  acquired  without  the  computer  getting  out  of  syn- 
chronization with  the  experimental  timing.    In  fact,  SB3  is  designed" 
t6  detect  vrtien  the  computer  has  gqtten  out  of  synchronization;  an 
error  message  will  be  typed,  and  the  computer  will  halt^    The  user 
will  then  have  to  revise  ^is  program  td  require  less  real-time  pro- 
'cess<tng.  •  With  SB7,  on  the  other  hand,  a*  complete  block  of  data  pbints 
are  acquired  before  the  subroutine  is  exited  and  control  returned  to 
BASIC.    Therefore,  the  -timing  is  limited  by  the  efficiency  of  the 
raachine-lanRuage  programming  developed  vithln  SB7.    For  our  comuuter 
systems  and^oftware,  it  is  possible  to  acquire  data  at  rates  as 
great  as  20  KHz  wit"^  SB7,  but  no  real-time  data  processing  is  possible. 

The  lifting  data  |-ate  when  using  SB3  is  about  500  H:s,  with  a 
mlnimm  of  real-time  data  handling.    If  several  BASIC  computations  k 
are  to  be  executed  between  data  points  when  using  SB!^  the  limiting 
data  acquisition  rate  may  be  the  order  of  1  to  10  Hz.    However,  this 
is  generally  more  than  adequate  for  most  experiments  in  the  chemistry 
laboratory.  v  \ 


Experimental  Control  and  Logic.  .  In  addition  to  data  acquisi- 
tion, there  are  several  other  way^  in" which  the  computer  can  commu- 
nicate with  the  experimental  system,.    One  of  thesefis  through  a 
digital^to-analOR  converter  (DAC^^!' where  the  digital  output  of*  the 
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couiputer  is  converted  by  the  DAQ  to  an  equivalent  analog  voltage 
level.    One  of  the  PRTB  subroutijnes  (SB9)  provides  the  capability 
for  driving  the  DAC,  the  output  of.^ich  can  be  connected  to.  exter- 
nal experiments.    It  is  possible  to.  generate-  a  continuoxis  voltage 
waveform  output  from  the  DAC  by  mathematical  generation  within  BASIC 
of  the  discrete  points  making  up  the  .waveform  and  transmitting  these 
throufji  3B9  to  the  DAC  in  a  repetitive  fashion  synchronized  with  an^ 
external  clock .  \  "  '  • .  , 

Subroutines  SBft  and.  GBU  allow  the  programmer 'to  utilize  Speci- 
fic output  bits  on  a  ^e^ected  I/O  , channel  to  control  eoctemal  devices. 
There  aire' l6  output  bits  available  for  this  function,   ^Wlth  SB8, 
the  user  can  select  which  bit  he  wants  to  set  by ^specifiitation  of 
the  bit  number.    The'  setting  of  one^  of  these  output  bits  causes  a 
corresponding  binary  voltage  .level  change  at  the  specified  output 
terjTiinal,  and'  this  can  be  used  to  clbse  or  open  switches,  start  or 
stop  experimental  events/,,  light  indicator  lamps,  etc.    SBll  ^allows 
the  programmer^ to  output  a  l6-bit  binary  voltage  pattern  with  any 
simultaneous- combination  of  "l"'s  and  "0"'s  he  choosee.    This  bi- 
nary pattern  is  selected  by,  including  in  th^  subroutine  call  the 
deqimal  equivalent  of  the  binary  number  to  be  generated^    Thus,  m&re 
than  one  event  can  be- -controlled  simultaneously. 

SB12  and  SBl's  provide  digital  input  information  for  program 
"sensinf^'^  of  external 'situations.    Thus,  l6  binary  voltage  input 
termincLLs  arej  etvailable  to  the  user,  "fihe  status  ol^  which  a.pe  acquired 
by  either  .subroutine.    SB12  transmits  to  tire  BASIC  program  the  ntzmer- 
ical  eqiiivalent  of  the  input  l6-bit  binary  Voltage^  pat^tern  Because 
the  status  of  these  bits  can  be  s'et  by  external  events,  the'  computer 
could  use  this  inf<>rmatiop  to  make  appropriate  changes  in  the  data 
processing  or  control  programmlh^g* 

•  .  S313  is  similar  to  SB12,  except'  that  only  the  status  of  a  single 
specified  input  bit  is  determiied.    This  subroutine  is  useful  to 
the  student 'because  he  need  only  specify  a  bit  number  in  the  call  . 
statement  to  check  the  status,  of*  a  bit. 

I  ,  ■ 

Jimlng  and  Synchronization.    iJhe  most  fupdamental  operation 
which  must  be  accomplished  by  the  PRTB  system  is.  the  generation  qf 
a  time  base  for  all  experim^tsl  functions.  .This  is  accomplished  by 
the  , incorporation  of  a  fiflred  frequency  crystal  clock  (lO  WIz)  into 
the  interface,  hardware.  •  Also  .ilncluded  is  electronic  coxintdc^  logic 
to  sQale  the  output  clock  pulses  down  to  a  usable  frequency  range 
fo^:  chemical  experimentat'ion  .(^00  KHz  to  0.01  Hz).    The  countdown 
logic  can  be. modified 'under  program  control  to  ©elect  any . frequency  . 
within  , this  which  can  be  generated  by  a  decade-^d/or  a  l,?,5-count- 
down  sequence.    The  J)rogrammed  clock  output*  pulse  train  is  then 
available  to  control  the  timing  on  the  ADC,  DAC,  or  any  other  exter- 
nkl  hardware.    AlBo  available  simultaneously  are  synchronous  clock 
^ul^es  representing^ frequencies  at  the  various  stages  of  countdown. 
(These  details  are  described  in  the  Experimental  Section.) 
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The  clock  is  contwHed  by  the  PRTB  doftvare  throu^  subroutines 
SBl,  SB2,  and  SBIO.    SBl  is  the  initiaaization  jBUbroUtlne.  Through 
It  the  programmer  specifies .  the  clock  frequency,  vithiq!  the.  limits 
outlined  above.     (SBl  is  also  used  to  specify  the  symbor  assigned'  /,  * 
to '  the  variable  vhich  vill  take  on  the  V8p.ues  of*  the  digital,  data  1 
acquired  in  SB3  or  SB7.)  ,     '       ^  ^ 

SB2  is  called  when  the  computer  program  decides  thtt  it  is  time 
to  enable  the  clock.    That  .is,  an  ENC0t)5  bit  is  set  -tea  "1"  atate  .  * 
on  the  data  acquisition  channel.    Th'is  Tjit  can  "be  jconnected  ^externally 
with  a  patchcord  to  the  ENABLE  terminal -on  the  clock  iriodule'  (see. 

^Figure  2-A).    Alternatively,,  the  EWCODE  bit  may  be  brou^t  to  'som<  '  _  ^ 
external  logic,  the  output  of  yhich  vill  set  the  ENABLE  clock  iftput 
Vheh  other  external  events  haVe  octurre^d—like*  "the  start  of'th^  ex- 
periment.   Ohe  possible  external  logic  configuration  i^  shdvrl  in 

.  Figure  2-B,  where  the.  ENCODE'  bit  conditibna  one  iijput  of  an  AND  ga?te. 
The  other  gate  input  is  conditioned  "TRUE"  vhen  the  expexlment  has*  x- 
been  initiated/  ^,The  AND  gite  output  vill  'then  enable  the  clock.' 
At  that  poipt  the  cou^it^ovn  lo*gic  vill  be  enabled  and^prOfi^rammed 
clock  pulses  will  begin  to  appear  At  the  available  outputs., 

^  \  *  '  .  '  ' 

SBIO  is^  called  to  disable  the  clock  vheh  the/ time  b^se  is  no 
longer, Squired.    It  sitoply  "clears"  the  ENCOPE  bit  on  the'd^ita 
acquisition 't:hannel  which  has  bien  used  externally  to  enable  the- 
clock,    •  r      •  ^        '  .  . 

'Another  t^pe  of  communication  required  betveen  the  e;^eriment  .  • 
'and  computer  is  associated  with  synchronia^ation  betveen  computer 
operations  ahd  ektemal  events.    For  "example,  ff*the  coB^ute^  has  ^ 
completed  all  preliminary  program  execution  required  before  being 
able  to  accept  experimental  4ata,  the  us'er  may  choose  to  receive  ax^  "/ 
output  f^om  the  computer    which  may  not  only  tell  -lihe  experiment  to 
start,  but  may  talso  initiate 'external  events  associated  with  the 
eOTerimental')57stem.    One  subroutine  available  for  this  kind  of 
operatiori  is  SB5.    5B5  causes  the  output  of  - an  ENCODE  bit  on  a  .  -  * 
selecte^  I/O  channel  to  cbtoge  sta^ie  (go  to  *the  ''TRUE**  lev6i)'vhen 
SB5v4s  culled.   -Then  the  computer  waits  within  SB5  for*  an  external 
event  tp  occur  which  will/cause  a  FLA<J' bit  on  the  same  I/O  channel 
to  change  to  a  TRUE -state.-  The^  subroutine  detect^  this  event  ajid  / 
then/^oliovs  the  nf^^^-geqnenti^  ^program  statement '  to.  be  executed.  " 
^This  h&Kt  statement  might  conceivably* be  a  call  to  SB2/vhicfi  initiates-* 
data  acquisition^  ^  .  *^  '  ' 

A  simpler,  but  less  precise ^^mekns  of  communicatioh  vith  the 
computer  for  the- purpose  of  syncijronization  is  provided^ by  SBU. 
When  this  subroaitine  is  entered  the  computer  simply  vaits  ^until  the  ' 
operator  flips  a  toggle  switch' corresponding  to  bit  0  an  the  cdmpil-' 
ter  console  svitph  register;    The  computer  then  exits  SBl^  and-exe*-. 
cutes  the  next  pro^^am  statement — vhich  might  be  to  call  SB2  and 
start  data  acquisition. 


FirwLLljr,- a^Vubroutipe  (SB6)  is  jivaila^le  which  allbvs  ge'neria- 
purpose  timing  functiofis.  ,This,  routine  simply  val*t8  for  the.  FLAG 
Mti  tg  b^  set*  on,  the  data  acquisition  channel,  clears  the  FLAG,  exits  - 
the  subroutir^e,  And  the  next  sequential  program  stetement  is  exe- 
cuted.   The  purpose  of  thig  subrbutine  is  synchronization  of  prb- 
gt'am  segi4entB.  yith  the  external*  time  bistitf.    (Ifote^  that  •S&5 'can  be 
u&ei;  also,  for  general-pvrpose  timing  functions  if  the  data  acqui- 
sition ch^inhel  is  needed  for  other  purposes.    The'  user  mu&t "'si«5)ly 
>conne&t  thtf  clpck  output' to  the  FLAG  input  of  tl^e  alternative  I/O 
'  channel . )  •  .    *  > 

;    •  "  *  •  ;   ^  ^  EXPERIMSNTAP   '  ^  ' 

Coratputer  Instrument  at  l^on.  '       .i-^      -  -  ^ 

The  digital  computer,  system  in  t^iis.vork  vas  a  Hewlett-Packard 
2115A,.  equipped  vlth  SKj^core- memory,  hi^  speed- paper. tape  input. 

Teletype,  a  l6^-bit  20  u^ec'JDAC,  and  an,  interfaced  TektrQnix  . 
Model  Box  o.sclllos'copic  -display.    In''ad^4ition,  the  computer  Has  >an 
interfaped  data  ac(iuisiti(5n  aystem  and  a  generaa^-rpurpose  experimental 
interface  capability.    (I^hese  are' describe'd  in  detail  belovO  .'The 
complete  conjmter  system  ip,  maunte'd  in  a  cabinet  which  has  roll- 
aroiind  capability*    The  *80ftwaire  ^^sed  for  laboratory  on-line  Operation 
'  (PRTB>  utilized  the 'H.P;'fiASlC  "compiler^  H^P,  #201112A.    This  soft- 
.w'are  is  jaade  available  from  t^e  co'mpvter manufacturer .  ^The  pi^o-gram^ 
listings  of  the  additioi>g.  to  the  BASIC  cdmpilar  made  ^here^tcT  .  . 

Implement  PRTB  arp  avallabie  from,  the^  authors  upon' rbquest>*. 
'  ^   ^   .     '' '  '    *    ,  ^       •  ' .  '  .       '  * 

Pat aT Acquisition  and  General-Purpose  Interface  Haa^dVare.    -  /  ^ 

.      .       • '  !       '  f 

schematic  diagram  of .  the  data.  acqCtisition  system  ^operated  in 

.  conjunction  with  PpT3  is  &hown  in.  Figvure^S.'  ,The^anal6g-to-digitai'  "  . 

converter  (ADC)  used  in  the  data  acquisition  system,  was  mknufACtured  ^ 

by^Digitai  Equipment  Gorpoi:ation  (DEC),  Maynard,'  Mass*  J^C#C-()02,. 

10-bit >  33  usee  co;iversion* time,  0  to'-10v23*V.  input  ranee): 

The  .programmable  cipck  is  constructed  from  a.  16  l-lHa-. crystal- ■ 
^c^troiie'd  oscillator  scaled  down  to  usable  frequency  ranges  with  -.  /* 
V£>\  taedivan-^clLle  integrated  circuit)  programmable  .coxmtdbwn  logic. 
A  schepi^itic  diagram  of  this  mo'dtile/is  giyen  in  ^igilre  'H,.  with  a:fcom- 
piiete  list  of  hardware  -cdAponents .    Programinfed  clock  pul?6&  Are       t  - 
available,  from*  100  KHz  to  .O^.OGSH^.r  Each  decade  frequency  cart  be  ' 
divided  by  2  or  5,,    The  specific,  frequency  is  selected  by  a  5-bit 
output. word  which  is  d?ecoded  in  the  clock  module  by  2  T^xfiis  Instru-  , 
ment  sijTUl^lj  ^Igittjl  mpitlpl^xei;s .    In  addition,  to  the  progr.ajmned, .  , 
.  output,  each  cWoade  output  frOmli.'MHz.  to  O.OOl  Hz  is:  available  ex-  *•  ' 
temally  by^patchboard  connection.    •  /  •         •     ,  . 

*i  .  *  *  '    *  '  '.',** 

'  Also  available  on  the-  dat.a- a^jquisition  panel  weM  .an  ENCODE 
output  bit  which  could  be  set  and  "oleared^by  the  computer,  and  a 
FLAG  innut  terminal  to  alloy  external  setting  o.f  the  FtAG  l?it  on- the 


computer  in^t/Output  (I/O)  cHey^inel  used  for  djita  acquisition • '  The 
end-of-conversion  (EX)C)  flip-floi>  of  the  ADC  was  normally  connected 
to  the  FLAG  input  i  howeVer,  it  was  pbssible'*  to  doonect  any  appropriate 
externally •  generated  signal  to  the  FLAG  input/   Six  bits  o*f  digital 
informa^trion  could  be  transferred  to  or  from  the  c6mputer  thru  the 
data  acquisition  panel  u^ng  patchboard  connectiofcs  on  the  p*anel*- 
Other  generally^  useful  functions  av^JLlaUle  on.  the  data  acquiEiltlbn 
panel  included  .patchboard  connection  to  vairiQus  logical  devices  such 
as  16  AND,' OR,  and  NOT  gates,  h  flip-flops,  2  one-^sho$s ,  X  toalog 
. switches,  U  relay  drivers,  a  5chmitt  trigger,  q,.  tr,ackii«^d-hold  ' 
amplifier,  6  indicator  ^.amps,  and  ?  push  button  switches  with  Schmit.t 
trigger  outputs,    (See  Ref,.  £t^or  a  general  discussion  of  characr 
*  teristics' of  control  logic  module^^.*^),  .DEC  R-se'ries  logic  (2;) -Flip- 
Chip  cards  -and  power  Supplies  were  used  for  logic  functions.    For  / 
interface  design  which  reqdiregL  more  SQphisttcntion  or,  more  logic 
elements  than^  cC)\Jld  be  obtained  on,  the  data  acquisition  panel,  a  DEC 
patchboard ^R-series^  L<?gic  Lab  {])  was  available*..  Most,  experiments; - 
did  'not  require  interface  hardyatte  ^yond  that  available  On  the  data 
acquiajftion  panel  •    All  input  and*  output  to  the  computer*  were  bUf-  r 
fered  with  level  conversibxii  devices  on'theH.P'.  2115A  system  «p  that' 
^1  external  connections  are  c^mpa^ible  with.  DEC  R-rseri.e^-  positive 
logic  (7)»    •DEC.W6.01  and  Wjld'level^  ton  version  cards:  were  \ised.' 

'  An  interfaced  ^glted'  l/O.motdule  was  .also  avedlabl^  for  general^-, 
purpose  digital  cpmiauiiicatio^i  between  computer  and'  experiment.    The,  , 
characteristics  of  this  module  tire  illustr^ed  i-n.  Fi^u^e  5*  This\''* 
digital  I/O  Module  was  'eonhected  tQ  a  ;dif  jferent  l/O  •cBauinel  than  the 
'data  acquisition  panfel.    Thus,  a  separate  independent  EIJqODE  output.  ' 
and  FLAG-  input  yere.  avail^able  *thni  this  mdd^le.'   The  .use  of  the 
digital  I/O  capAbili'ty  hasr  been  described  above*  -  ,  ' 

"  •    'RESUtTS  AND  JilSCUSSION' .  ^  * 

.EVa-luation  of  tHe  PRTB  ^gystem^       '  •  '   -     -  '  .  / -  - 

Foj;  the.  pixrpose  of  'iiiostrating  the  capabilities  of  "^the  PRT» 
software  fQ.r'this  publication,  the.  Is^bpratory 'computer  systems  and  , 
software  were  tested  y^h"  two.  synthetically  generated  experimental 
outputs  il^hich  simulated  typical  experimeptaj.  data  i^yQ)  *    'Aie  first ;  , 
of  these  involved  a  transient  experiment  with  a,  rapid  exponential  . 
yoltagj?'  decay  output^*    The  second  illustration  involved  application 
of  ensemble  averaging  to  the  analysis  of  an  experimental  output'./^ .  ^/ ' 
which  ' provided  a  ana^etitive  voltage  waveform  witH^superirapos^  laurge. 
amplitude  randoin^noise,  .     *        *  ^  ^ 

'  ^  "  .   *  \       '  -     ^         '  .  . 

The  ejtporitenHi^al  decay  waveform  had  a  time"  constant  tHe  order  j-' 
of  <J,1  sec*    Data  ^erd  taken  at  a  constant  rate  for  a  fixed  iength* 
of  time,  and  no  -pfocess^ing  of  the- ^at^a'Vas  carried  out*   .'JS^e  datk  '  • 
wer^f  acquired,  - stored,  and' the  digitized  data  printed  out  at  the 
'  completion,  of  the  experi'ment/   The  program  used  is  given  in  Figure  6l 


vayeform  foV  the  ehsemble  ayerA^ing' ifexperiment  had  a  fund^-... 
mental  tx^cjueney -the  ordftr  of  1^  Hz»^  with  superimposed  large  aflipli-  . 
*  t\x4e\rtffldom  noi$e.'  *.The  experiment  was  repeatable  and  coxild  be       '  ► 
triggex*e*d  hy  computeir  output.  "'The  pyogramminjg'  pb'Jectivea  itttluded  > 
repetitive  initiation  of  tH'e  e^erlrpent./ syn6hronizattijpti  of  data 
acqifclsitiort  ..with  e^p^j^imental  coatput,  and  repetitive  coherent-  suni- 
ma^tlon  of  l:he  jil git iMd  vaVe forms  to  accomplish 'ensemble  averfiiging- 
.Finally,  vheii  the  •e;egertni^ht  wais. completed' the.  averajgedi  dat^a  were 
nonngli zed. . aijd  plot1iie4\ohj- the  teletype  t  The  computer  pro-?  ... 

gr^xi  used  is  giVen  in-Ptgur^  l\    Figure  '6  shows  the  orlginetl  output 
wAveform' and  the  redults  of  ensemble  averaging. 

r 

-Observations.^  . 

It  can  be^  seen  from  the  above  examples  that  actuaut  implementa- 
tion, x^f  tlie  "PRTB  3ystert  is  relatively  straightforward.  Moreover,' 
for  all  its  simplicity:,  consi4erable  experimental  measurement  capa- 
V  .biiity  *>fe-*idt,s  *i«  the  system.^        ^  .  , 

The  Yequireinertts  for  implementing  the  PRTB  system  in  the  under-  » 
'^graduW^is  laboratory  include  the  following: 

{1}  an.  intro.d^ictloo^o;  the  BASIC  lahguage  for  computational  pmrppses.. 
'  Tfilfe*' require^  .  1  to  ^-Wurs  O'f  lecture' time.    It  euLso  requir.es  • 
\.tliat  ♦8tudenifl*have-accesr$  to  the  laboratorjr  computer  (or  other 
,        compxlter  facilities  providing,  MSIC  capability)  during  the  first 
fey-veeks  of  the  i<imester  to  handle  homework  and  laboratory  com- 
*    ..'^putja^t^on^s:  *  k  .v.;.  -  _ '  *  ^  , 

>^^r2)\€m  introduction  .'to  .the  futidftrtenta^  conc^ptsof  on^-line  compjiter' 
\  ,  opera'tiori,  ^jj^is  i^quir^a^* about ^^^^>h6TS*e  of  lecture.  ^  These  intro^ 
iiuce-We  student -to''i>h^  whole  field' of  jbjarJLine  computer  appli- 
ca£ipn%  diha.the  tisehnciloi^ical  H^tillj^ith  which  they  must  be 
,  .^.ffi^iliur  to"" implement  this- approsSTin  the  laboritory:    Ri^se  "  ^ 
/details  include  an  i^^rroduction  to- the  fundamenta:ls  o.f.  digital 
logic-j  ^t^iinir^g,  and  ^yn-cfiroinization.    This  Involves  the  dispussion 
:  ^  ^  of /3impl^'^gat6s^'^AI^D",^<^^  the  flip-flop,  the /me -shot; 

.  arid  the  analog-switch.    The  function)^  icharact^^ilisttcs  of  these 
devices  >  *a*&  ak  'x^f  aftalog-^o-rdigital  ahd  digital-to-aJialog 

pon'verter^  and  yoltag6  arapli-fitfrs.,  for  interface  defg^gn  are 
\.'[  ^,  de]firred.        attempt  is  fiiade.  to  provide  ^  'rigorbuis  und.er.stabdltig 
of  the  e3.eci;rpn.ic  principled -or  d^^teiiled/crfi'cuil^jpjj.-for  these 
^  devl^e^V*        '^'"Ph.^^i^      .9^?        student  being /^a^jlV  .recognize, 
emd^-use  the.  fundelln6ntAl''cliaracterid.tic's^  ojf,  these  difeVlces  f^or  *  ' 
inter f'acfe  (Jesi'gnn..,.-  '        '.^        /' ^  ^  / 

'.(3)  finally,  th^  Actual  l^b^ratory  assignment^  Aire  glifried  out..^-- 
^    These  -e^ssignmsnts'  are -design'ed.ld  .iirtV^bd'uce^       student  to  thfe  . 
.^alyt'ical  technique and  linethodolpgy  ^  ju*&t  J^s' votdd  be  don^-'ih. 


the  absence  of  the  laboratory  computer  instrtunent^tion.  In 
addition,  the  student  is  expected  to  utilize  the  on-line  com- 
puter as  a  data  acquisition  and  d^ta  processing  tool. 

The  final  question  to  be  considered  here  is  what  specific  bene- 
fits are  derived  by  the  student  from  the  use  of  the  laboratory  com- 
puter in  on-line  experimentation.    First  of  all,  the  student  is 
obviously  exposed  to^  state-of-the  art  technology  in  laboratory  ex- 
perimentation.   Secondly,  the  student  is  encouraged  to  luse  more 
rigorous  data  processing  approaches  theui  previously  .feasible  for 
laboratory  assignments.    Thirdly,  the  student  becomes  keenly  aware 
of  the  factors  vhich  limit  and  define  experimen^;al  accuracy  and 
precision.    Fourthly,  the  student  is  able  to  make  ^  first  hand  com- 
parison of  the  effectiveness  of  conventional  6uid  computerized  lab- 
oratory methodology.    And,  finally,  the  student's  interest  in  the 
science  of  queuititatiye  chemical  experimentation  is, strongly  stimu- 
lated. 

Future  Work. 

Severed  different  Experiments  involving,  on-line  computer  studies 
are  cx^rrently.  be,ing  developed  which  are  appropriate  for  incorporation 
into  'the  undergraduWe   -anklytical  laboratory^.    The^e  include  kinetic 
methods  of  analysis ,  ^araperometric  titj^ations,  couiometric  fitrati-ons,  - 
.  iwte^tiometric    titrations,  spectrophotdmetric  anatLysis,  fast-sweep 
.  polarographic  emalysis and  thermal  aneGlysis  methods,  in  addition 
to  synthetic'  and  ^Trromatogrkphlc  experiments  already  developed  ( 8.,2.)  • 
*"9bme  of  these  experjlntentSv  will  be  included  in  the  introdu?;tbry  course 
and  some  in  an  advanced  analytical  8ours^  (s^eriior  level  at  Purdue). 

It  is  aiso  desired  to  further  improve  the  "PRTB  software  so  that 
the  experimental  subroutine  calls  can  be  replaced  by  'conversational 
macro-instructibns .       -    ,  '       ^  , 
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TABLE  li  , 

DATA  ACQUISITION  AND  CONTROL  SUBROUTINES  FOR  PRTB 

SUBROUTINE  FUNCTION  CALL  FOR?>tAT 

SBl  Initialize  I/O.    Set  up  data  storage       CALL  (l,X(l),F) 

address  and  acquisition  rate  for  other 
subroutines •    "X(l)"  data  variable. 
"1^"  =  data  acquisition  frequency* 

•  f 

SB2  Used  to  sta^t  the  CLOCK •    The  output'      CALL  (2) 

control  bit  (ENCODEKcan  be  connected 
to  some  external  logic  preceding  the 
CLOCK  ENABLE  input.     (NOTE:     SB2  must 
be  preceded  by  SBl.)      -  - 

SB3  Waits  for  ADC  FLAG,  indicating  con-         CMiL  (3) 

version  completed.    One  datum  is 
thfn  taken  from  the  ADC,  converted     '  - 
to  floating  point,  8uid  saved  as  X(l), 

SB^i  When  called  this  subroutine  va^ts  for     CALL  (U) 

switch  0  of  the  console  svitch  regis- 
ter  to  be  set  to  a  "1    before  con- 
tinuing with  the  next  statement  in 
the  BASIC  program. 

SB5*  ,  Outputs  one  control  bit  (ENCODE)  and       CALL  (5,C) 

then  waits  for  an  event  FLAG  on  the 
specified  I/O  channel,  "C",  before 
continuing.    FLAG  and  ENCODE  are 
\    cleared  before  exit. 


SB6  -       Waits  for  FLAGS"  on  the  ADC  channel./        CALL  (6) 

When  FLAG  is  set,  the  next  statement 
^  4rj  the  BASIC  program  is  executed.  •  ' 

(Mufi(t  be  preceded  by  SBl  and  SB2  if 
internet  CLOCK  is  used  to' generate 
FLAGS.) 

SB?  Takes  in  complete  block  of  data  •    CALL  (7,T) 

before  returning  to  BASIC  program. 
(Max.  of  2$b  pts.    Must  be  preceded 
by  SBl  each  time  it  is  called.) 
"T"^sets  total  No.  of  data  pts. 
tCLken.    Synchronized  with  start  erf  , 
experiment  through  external  gating 
of  the  ENCODE  output.    Up  to  2C  KHz 
data  acquisition  rate  possible. 


SUiBROUTINE 

\  , 

\ 

SB9 
SBIO 


SBll 


SB12 


SB13 


f 
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FUNCTION 


''Z"  to  be  set 


Causes  output  bit  No 
TRUE  on  specified  I/O  channel,  "C" 
Z,  C  specified  in  decimal.  All 
other  bits  will  be  cleared. 


Causes  output  of  analog  voltage 
throxigh  the  DAC 


^'D"  -  rav.  outp\rt* 


Clears  ADC  channel%  'External  Flip- 
flop  driven  by  ENCODE  will  be 
cleared  aiLso.    This  can  be  used  to 
turn  off  clock,  stop  experiment,  etc, 

Causes^  output  of  binary  voltage  pat- 
tern, "Z",  on  specified  output 
channel,  "C";    Binary  pattern  and 
channel  specified  in  decimal. 

Inputs  status  of  l6-bit  register  on 
channel  "C".    The  floating  point  , 
equi valient  is  saved  in  memory  as  "Z", 

Inputs  status  of .a  specific  bit 
(1^0.  "Z")  on  channel  "C".    Va^ue  of 
"S"  is  uade  "l"  or  "0"  accordingly. 


CALL  FORMAT 
CALL  (8,Z,C) 

CALL  (9,D) 
CALL  (10) 

CALL  (11,Z^C) 

CALL  (12,Z,C)' 
CALL  (1'3>Z,S^C) 
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Figure  !•    Schematic  Diagram  of  On-Line  Computer  Configuj-a.tion» 

Figure  2*    Synchronization  of  Clock  Output^ vltH  Computer  and  Experi- 
mental Events . 

A,    Direct  computer  control  of  Clock  Enable  input*  , 
Clock  Enable  controlled  by  AND  gate  output.  Clock 
enabled  when  both  computer  command  and  experimental 
start  are  seen  at  AND  gate  inputs*    The  "RUN"  flip- 
floi$' output  *  follows' the  status  of  the  experiment^; 
it  iff  set  "true"  when  the  experiment  starts  and 
"Fal^e"  when  the  experiment  ends.    Thus,  the  Clock 
is  disabled  when  the  experiment  ends. 

Figure  3.    TypicfiLL  PRTB  Experimental  Data  Acquisition  Set-up. 

Previously  undefined  symbols  include; 

I/O  Bus  -  Input/Output  hardware  (jf  the  coraputer,  one 

channel  of  which  is  used  for  data  acquisition 

system. 

l6-bit  duplex  register  -  A  Hewlett-Packard  #021l6-6l95 
l6-bit  duplex  interface  buffer  register  card, 
also  providing  output  ENCODE  command  and  input 
^  FLAG  signal. 

Figure        Schematic  Diagram  for  PRTB  Programmable  Clock. 

Components:  ^ 


Device  No, 


Manufacturer 


Description 


F910959 


Fairchild 


MCTi^T5P 
SN7i^l51 

F9 30059 

1? 

F900359 


Nfotorola 

Texas  .Instrument 

Fairchild 

Fairchild 


Hex  inverter- 
six  Level  Con- 
verters, High 
Level  to  -^h  to 
+5/0  V. 

Qua\i  latch-r^ 
bistable  latches 

Data  Selector/ 
Multiplexer 

li~bit  shift 
register 

Triple  3-input 
Nand  Gate 


Figure  5*    Digital  l/O  Interface  Module  for  PRTB, 
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Figure  6»    Program  for  Data  Acquisition  from  Exponential  Decay- 
Function.* 

Figure        Program  for  Ensemble  Averaging  Experiment. 

Figwe  8.    Data  fi^m  Ensemble  Averaging  Experiment »  plotted  on 
Teletype . 

A.    Original  data  (single  cycle). 
B»    Data  after  100  averaging  cycles. 
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READY 
LIST 

10  REM  THIS  IS  A  PROGRAM  THAT  WILL  PRINT  OUT  THE  DECIMAL  HUMBER 

20  REM  EQUIVALENT  OF  AN  ANALOG  EXPONENTIA|i  DECAY  CURVE. 

30  DIM  X(250) 

U0  LET  F=»100 

50  LET  T«250 

60  CALL  (1,X(1),F). 

70  CALL  (T,T) 

80  FOR  I»l   'TO  250 

90  PRINT  X(I);  ^ 

100  NEXT  I  ■  .  . 

110  END  '  '  ■  '  ' 


FEA5Y     '  • 
LIST 

1  riEM  THIS  PROG^AW  IS  AN  ENSEMBU  AVERAGING  ROUTINE— FOLLOW 

2  EEH  THE  DIRECTIONS  PRINTED  OUT  AFTER  PRESSING  RUN.  THE 

3  ReS-  FREQUENCY  ORIGINALLY  SPECIFIED.  IS  1000  HZ. 
10  DIM  X(100),Y(100) 

20  LET  ,1^1000 

30  PRINt  "IHE  #  OF  PTS  EACH  RUN 

31  INPUT  T/ 

40  PRINT  TkE  f  OF  PUNS 
•41  INPUT  t 
■  50  MAT  Y=ZER 
60  FOR  1  =  1   TO  C 
70  CALL(l,X(n,r) 
B0'CALL(7,T3 
90  FOP  J=l   TO  T 
im  LET  Y(  J)  =  YCJ)+X(,J) 
110  NEXT  J  , 

120  fl&XT  I  '  .  ■ 

20fl  FOP  K=l   TO  T 

2fl0  LET  E=Y(  K)/CC*10) 

220  IF  E>7?.  THEN  300 

?.30  PFINT  TAn(E){  * 

240   ^)EXT  K 

250  STOP  * 
300  LET  E=71 

310  GOTO  230  ■  ' 

^00  END  « . 
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COrWTER- BASED  TEACHING  OF  ORGANIC  CHEMISTRY 

Stanley  Smith  ^  ^ 

'Department  of  Chemistry 
University  of  Illinois %t 
Urbana,  Illinois  618OI 


Increased  demands  for  high  quality  chemistry  courses  for 
students  with  diverse  backgrounds  and  objectives  provide  a 
challenge  to  the  development  of  innovative  and  more  effective 
teaching^  methods . 

A  tutorial  procedure,  in  which  each  student  has  the  complete 
attention  of  the  professor,  would  meet  the  educational  needs,  but 
the  many  students  involved  and  the  high  cost  of  this  approach 
prohibit  extensive  use  of  this  method.    However,  the  development 
of  computer-based  educational  systems  makes  it  possible  to 
appr9ximate  closely  the  one-to-one  correspondence  between  a 
professor  and  a  student  and  offers  techniques  for  providing  many 
students  with  essentially  individual  attention  at  acceptable  cost 
levels.    One  of  these  systems,  PLATO  (Programmed  Logic  for 
Automatic  Teaching  Operations),  developed  at  the  University  of 
Illinois,  has  been  programmed  to  present  various  aspects  of  under- 
graduate icourses  in  organic  chemistry.  ^Unlike  films,  television 
lectures,  and,  all  too  often,  live  lectures,  PLATO  is  not  a  passive 
learning  aid.    The  student  is  directly  involved.    His  decisions  and 
his  actions  control  his  progress. 

In  the  present  PLATO  system,  each  of  twenty  student  terminals 
is  equipped  with  a  television  screen  ana  a  keyset,  which  closely 
resembles  a  typewriter  keyboard.    Informtion  displayed  on  the 
screen  may  consist  of  words,  number^,  formulas,  graphs,  or  diagrams 
drawn  by  the  computer  or  images  selected  by  the  computer  from  a  set 
of  slides.    By  t)ae  superpositioning  of  computer  plotting  and  slides, 
complex  structures, .formulas,  and  diagrams  can  be  easily  presented 
to  the  student.    In  turn,  through  the  keyset,  the  student  asks  and 
responds  to  questions,  supplies  parameters  for  calculations  to  be 
performed  by  the  computer,  and  suggests  reagents  for  chemical 
reactions  which  the  computer  is  to  illustrate.    Since  the  computer 
can  accept  and  respond  to  chemical  terminology,  it  is  unnecessary 
for  students  to  be  acquainted  with  computer  programming  techniques. 

Verif icatioi\  of  a  correct  response  or  the  correction  of  an 
error  requires  less  than  a  second.     If  the  instructor  wishes,  the 
student's  procrccG  throu^^h  'he  lesson  material  may  be  controlled  by 
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the  number  of  errors  he  makes.     If  the  student  has  difficulty, 
extra  help  can  be  pro\rided  automatically.    A  printed  record  of  each 
student's  work  is  provided  for  the  instructor  by  the  computer, 
making  it  possible  for  the  progress  of  any  individual  to  be 
reviewed  in  detail. 

A  sumiaary  of  the  lessons  currently  in  use  is  given  in  Table  1. 
These  programs  combine  tutorial  techniques,  drill,  and  simulated 
experiments  to  develop  chemical  principles. 


Table  1.    Summary  of  Organic  Chemistry  PLATO  Lesson^ 

1.  Introduction  to  organic  structures  and  nomenclature. 

2.  Structure,  nomenclature,  and  reactions  of  alkenes. 
5.    Properties,  nomenclature,  and  reactions  of  alcohols. 

Aldehyde  and  ketone  chemistry.  ,  . 

5.  Introduction  to  aliphatic  synthesis. 

6.  Mult i step,  aliphatic  synthesis. 

?•    Introduction  to  electrophilic  aromatic  substitution. 

8.  Multistep  electrophilic  aromatic  substitution 
reactions. 

9.  Introduction  to  nmr  spectroscopy,  chemical  shifts, 
spin- spin  coupling,  and  interpretation  of  the  proton 
nmr  spectra  of  organic  compounds. 

10.  Carbohydrate  chemistry.  . 

11.  Simple  and  fractional  distillation. 

12.  Qualitative  organic  analysis.'  Identification  of 
unknQvm  compounds  by  combination  of  spectroscopic 
and  chemical  techniques. 

Of  particular  interest  is  the  use  of  simulated  experiments.  In 
textbooks  and  lectures,  the  results  of  important  experiments  are 
usually  given  as  facts  to  be  learned.    Although  it  is  only  rarely 
possible  to  allow  those  in  a  large  class  to  do  these  experiments, 
the  computer  can  be .progranmed  so  that  each  student  may  conduct 
simulated  experiments. 

The  general  concept  used  in  these  simulations  is  to  allow  the 
student  to  ask  the  computer  to  perform  a  series  of  experiments  and 
report  the  results.    When  the  student  is  satisfied  that  he  has 
devised  the  correct  experiment  and  understands .the  results,  he  may 
proceed  with  the  program.    Of  Qoi^se,  he  is  immediately  asked 
questions  which  can  b^- answered  if  the  correct  experiments  have  been 
done.    If  the  student  cannot  answer  the  questions,  the  program 
indicates  the  type  of  experiments  he  should  investigate  and  directs 
him  to  that  part  of  the  lesson.     Tt  iS  hoped  that  this  procedure  will 


encourage  and  reward  learning  by  discovery  and  analysis,  thus 
providing  a  depth  of  understanding  which  only  rarely  results  from 
learning  by  rote-- 

The  bromination  of  ketones  is  used  here  to  illustrate  this 
approach  to  learning*    Figures  1-5  are  photographs  of  the  television 
display  on  a  student  terminal-    Figure  1  shows  the  screen  before  any 
experiments  have  been  run.    In  previous  frames  the  student  has  been 
advised  to  selexit  values  for  the  concentrations' of  the  two 'reao^nts, 
sodium  hydroxide  and  bromine.    In  Figure  2,  three  student- selected 
experiments  have  been  recorded  as  graphs  of  ^  yield  of  product  vs. 
time.    The  first  experiment:  is  plotted  as  a  series  of  ones,  the 
second  as  twos,  etc.    It  should  be  noted  that  experiments  2  and  5 
give  exactly  the  same  curve.    The  student  may  perform  as  many 
experiments  as  he  wishes,  each  expe;riment  being  completed  in  about 
one  second.    When  the  student  chooses  to  go  on,  the  question  given 
in  Figure  5  appears.    Here  he  musl  indicate  how  each  of  the  reactants 
affects  the  rate  of  formation  of  the  product  of  the  reaction.  When 
an  incorrect  einswer  is  given,  the  computer  automatically  suggests 
further  experiments  and  returns  the  student  to  the  display  shown  in 
Figure  1.    It  is  important  to  note  that  no  clues  are  provided  by 
the  format  of  the  questions.    The  computer  has  been  programmed, 
through  key  word  judging,  to  analyze  the  validity  of  the  student's 
responses. 

Development  of  the  facility  to  outline  suitable  schemes  for 
the  synthesis  of  compounds  is  also  an  important  componenl  of  intro- 
ductory courses  in  organic  dhemistry.    Even  relatively  sitople 
molecxiles  can  provide  a  stimulating  intellectual  challenge  to 
beginning  studentfe.    However,  to  encourage  imaginative  approaches, 
it  is  important  to  allow  each  individual  wide  freedom  in  the  choice 
of  starting  materials  and  reaqtants.  .In  addition,  rapid  evaluation 
of  the  correctness  of  a  proposed  synthetic  scheme  promotes  interest 
and  confidence.  • 

Algorithms  have  been, devised  which  permit  the  PLATO  computer 
system  to  judge  the  correctness  of^a  propose^  synthesis  "by  computing 
the  structure  of  the  product  of  each  step  iSTthe  reaction, 
displaying  the  structure  on  the  student »s  screen,  and  comparing 
the  structure  with  that  of  the  desired  final  product.  This 
approach  permits  a  student  to  verify  quickly  the  correctness  of 
a  proposed  synthetic  scheme  even  though  it  may  be  quite  different 
from  that  outlined  in  the  textbook  or  lectures.' 

The  start  of  a  typical  program,  at  the  first  semester  organic 
chemistr:,^  level,  is  illustrated  in  Figure^U*    The  student  is  first 
asked  to  indicate  the  starting  material  of  his  choice.  Either 
formulas,  common  names,  or  TUPAC  name^s  may  be  used.    The.  require- 
ment of  three  carbons  or  less  in  the  starting  material  was  imposed 
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BTomln8l!on  of  CgH5-C-C-CH2CH3     (0.01  ft) 

H  Concentrations 
100  NaO.H  Br^ 
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TIm*  Mlnutts 

Figure  1.    Photograph  of  PLATO  screen.    To. study  factors  affecting 
rate  of  reaction,  the  student  selects  various  concen- 
trations of  NaOH  and  Br^. 
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Figure  2.    The  results  of  three' experiments  plotted  as  yield  of     ,  * 
,  product  vs;.  time  are  displayed  on  the  screen.  E3q)'erimenlis 
•*      '        '2  and  3  gi Ve  the  same ,  graph .  ^  /       ;  '  ,      . '  . 


What  happens  to  the  rate  of  bromjnatfon  of 
ketones  In  basic  solution  when  the 
toncentratloh  of 

(a)   NeOH  Is  Increased  ^  , 

# 

t  - 

ft>)  liKreas^d  ^ 


Figure  5.    If  the  correct  experiments  have  been  done,  Figures  1  and 
»   2,  ^these  questions  can  be  ansWered.    Key  wd>rd  judging  is 
«  used  by  the'' computer  to  recognize  a  correct  response. 


Type  iKe  neme  or  formula  of  the  star  tine 
Mterla)  (3-earbons  or  less) . 


Stail!T«  naiWial :  « 


.  0 
Ffi»l  Product:  || 

CH3CHJCCH2CH3 


Tor  help  press  HELP. 


Figure  U.    Start  of  aliphatic  synthesis.    The  student  must  indicate 
•    the  starting  material  of  his  choice. 
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because  of  the  desire,  for 'petiagogical  reasons,  •to  restrict  these 
particular  problems  to*  compounds  with  relatively  few  carbon  atoms 
in  order  to  fgcus  attention, on  the  chemical  reactions  involved. 
At  present,  55  diffei*ent  starting  materials  are  recognized,  each 
by.  severeil  names. 

The  computer  responds  tcS"  a' suggested  starting  material  by 
drawing  the  structureil  fonmJja  and  requesting  that  the  student  type 
the  name  or  formula  of  the  reagents  to  be  used  in  the  first  ste^ 
of  tlie  synthesis. 

In, Figure  5,  propionaldehyde  has  been  selected  as  the  starting 
materiaCl'  and  in  Figure  6^the  i^agent  for  the  first  step  is  to  be 
indicated.    Any  of  l^^i  reagents,  whicl\can  be  either  organic  or 
inorganic,  may  be  used  in  each  step..  "Again,  t;p  promote  ease  of  use 
each  reagent  is  recognized  by  several  names.    Additioneil  synbnyms 
or  chemically  similai;  reagents  may  be  incorporated  into  the  lesson 
in  about  one  minute,  even  while  a^class  is  using  the  program. 

The  product  of  the  reaction  automatically  becomes  the  starting 
material  for  the  next  step;  until  the  synthesis  is  complete. 
Figure  7.    Should  a  given  scheme  not  l^ead  to  the  product,  it  is 
possible  to  start  over  at  any  stag^  by  pressing  a  single  key. 

Since  there  ar6  many  acceptable*  solutions  to  each  problem, 
upon  completion  of  ^  synthesis,  the  student  is  asked  whether  he 
wants  to  work  the  problem*  again  or  try ''a  different  problem. 

Type  the  name  or  formula  of  the  starting 
*        Mierifil  (3  carbons  or  less)  . 

StdTlIng  flaierlai :  *  proplon&ldehyde 


0 

Flrel  Product:  ;  || 

CHjCH^CCHjCHj 


Tor  help  press  HELP. 


Figure  5.    Propionaldehyde  has',b?en  selected  as  the  starting  point 
for  the  proposed  syi^theSis  of  5-pentanone. 
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CHjCH^CH 
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CH3CH2CCH2CH3 


.     •      To  start  again  press  OfiTfl.  ^ 

Figure  6.  .PLATO  has  drawn  the  structural  formula  of  Ihe  selected 
starting  material.    The-  reagent  for  the  first  step  in 
the  synthesis  is  iv>w  to  be  indicated. 


CH3CH2CH  ' 

H30;  cr^ 
0 

If 

CH3CH2CCH2CH3 


Jtie  synthesSs  is  com^>leted. 

To vprk  THIS  problem  ago! n  press  DATA 
Jo  go  on  press  NEXT   ,  =»  unjn. 

Figure  7.   .Summary  of  the  completed  sjrnthe'sis  of  3-pentanone. 
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Syntheses  involving  electrophilic?  aromatic  substitution  have 
been  progranmed  in  a  similar  way,  with  the  computer  again  sh6wing 
the  product  at, each  stage  of  the  student *s  synthesis)  aiid;  where 
several  isomers  are  formed,  e^.g. ,  ortho-meta-para  mixture,  the  ^ 
composition  is  computed  from  appropriate  linear  free  efiergy' 
relationships  and  displayed  on  the  screen  so  ttiat  the  student,  can 
.develop  a  general  awareness  of-^somer  distribution  in  these 
reactions. 

The  current  introduction  of  the  PLATO  IV  system,  which_) 
utilizes  plasma  panels,  will  make  it  possible  for  up  to  U,000 
terminals  to  be  operated  from  a  central  ccraputer. 
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Compu-^er-Assisted."  Instruct ij^n  in  Chemistry  at  Pitt 


E.  E.  Tratras  and  K.  J.  Johnson 
Department  o^  Chemistry 
University  of  "Pittsburgh  . 
'Pittsburgh^  Penna.  15215 


Sixteen  tutorial-drill  CAI  programs  were  made  available  to 
freshman  chemistry  students  diiring  the  fall  term  of  1970.  They 
have  been  implemented  on  an  IBM  560/5O  computer,  using  the  Pitt 
Time-Shying  System  (PTSS),    PTSS  supports  the  CAI  language  CATALYST/ 
PIL.    CATALYST  (Computer  Assisted  Teaching  and  Learning  System)  is 
a  lesson-designer  oriented  language  developed  at  Pitt,    PIL  (Pitt's  • 
Interpretive  Language")  is  a  JOSS  -  derived  conyersational  language.  , 
The  CATALYST/PIL  interface  provides  a  nearly  ideal  CAI  language  for 
both  lesson  designers  eind  students. 

Arithmetic  FeatiU'es 

One  of  the  most  attractive,  fea'&ures  is  the  ease  wi^th  which 
students  can  make  chemical  calculations.     In  addition^ to  the  stand- 
ard verbal  and  numeric  answers,  CATALYST/PIL  will  accept  ari'thmetic 
expressions.    For  example  in  an  ideal  gas  law  problem  the  answer 
might  be  typed  as  follows:  «  *  ^ 

>785*2.5*(50  +  27^)/(275^'60*.75)  . 

iswer  to  an  equilibrium  hydrogen  ion^oncentration  problemN, 
^e  given  (approximately)  by  * 

>{0.l85*1.8e-5)**(.5) 


The  answe 
would 


If  the  student  is  interested  in  the  magnitude  of  the  answer  or. 
if  the  problem  involves  - mor?  than  one  arithmetic  expression,  the 
PIL  processor  is  made  available  when  the  student  types  "calc".  The 
following  is  part  of  a  lesson  drilling  percent  cqmposition: 

The  computer  dialog  is  in  upper  case. 


WHICH  OF  Tj^  FOLLOWING  MINERALS  IS  RICHER  IN  COPPER? 

iMINER;^  *      MOLECULAR  WEIGHT 

-  (Cu)5Fe(S)l.  502  GRAMSAio'LE 

(eu)2S  '  .  159  GRAMS /MOLE 

(HINT:    DETpRJ^l'lNE  THE  PERCENT   Cu  IX  BOTH  MATERIALS) 


>calc 
,  READY 

>fype  5*65.5*100/502 
'  5*65.5*100/502;  =  65.21*701    '        ■     .  •  * 

>t,ype.2*65  5*100/159 
2*65.5*100/159  =  79-871*21 
>returr\' 

RESPOND  TO  LAST  QUESTION 
>Cu2S       ,  A  ■ 

CORRECT.    NOW,   .   .  . 

Th6  response  "calc"  brings  PIL  to  the  foreground.  '  PIL  types 
READY**,  and  the  student  can  use  the  desk  calculator- mode  as  is 
done  here,  load  a  prewritten  program  or  write  and  execute  a  PIL 
program.    When  he  has  the  answer  he  reactivates  cAtALYST  by  typing 
return".    CATALYST  then  asks  for  his  response  to'the  last,  question 
and  ooatinues. 
*  .  > 

PIL  arithmetic  functions  are  very  close  to  English.    For  ex- 
'  ampl"e. 


WHAT  IS  THE  (APPROXIMATE)  pH  OF  A  0. lU?  MOLAR  ACETIC  ACID  SOLUTION?* 

(Ka  =  1.8E-5) 

>calc 

READY: 

>type  -  the  log  of  the  square  root  of  (O.  llfTn.Se-^) 

-  the  log  of  the  square  root  of  (0. 1^7*1^. 8e-5)  =  2.78&705 

>return  .  '  ^ 

RESPOND  TO  LAST  QUESTION 

>2.79  .  ''^  ^ 

VERY  GOOD.     NOW,  ... 

Another  useful  arithmetic  feature  of  PIL  for  CAI  applications 
is  the  pseudo  ri^ndom  number  genei:at©r.    At  any  point  the  lesson 
aesigner  can  acquiri?*'a  random  number  uniformly  distributed  between  ' 
zero  atvi  one.    To  distribute  the  number  betyeen  arbitrary  limits  a 
and^b,  the  simple  algorithm 


\^  =  .a  .  (b-a)  r^^ 


is  used.    For  example  if  a  rah^iom  temperature  between  273''  K  and 
575"*  K  were  desired,  the  expression 

T  =  27>  +  100  *  r^. 

01 

would  be  coded  in  the  CAI  lesson.    Then  if  the  random  nuiAber  generator 
returned  a  value' of  z^w,  T  would  be  assigned  the  value  273/    If  r^ 
is  one,  T  becomes  375- 


Stoich: 


Stoich: 


We  have  utilized  thes^  arithmetic  features  of  the  CATALYST/PIL 
interface  in  designing  the  CAI  molecules*    Most  of  the  lessons  drill 
numerical  problems  in  the  areas  of  stoichiometry  and  aqueous  equili-r 
bria.    These  problems  frequently  include  PIL-generated  random  numbers. 
Thus  no  two  students  w^ll  receive  the  same  numerical  piroblem.  The 
format  of  the  problems  are  fi*ed,'but  the  valuee  of  the  parameters 
(e.g^  temperature,  mass^  concentration^  etc!)  vary  randomly. 

The  sixteen  CAI  lessons  are  described  briefly  ^elow. 

Description 

Seven  questions  reviewing  molecular  weights,  gram- 
mole  and  moleclile-mole  conversions,  empirical  form'-, 
ulas  and  percent  composition  problems- 

Four  problems  reviewing  molecular  weights,  gram- 
mole,  molecule-mole,  and  atom-mole  conversions; 

Four  problems  dealing  with  percent  composition* of 
metal  chlorides,  molecular  weight  of  an  element 
from  its  isotopM.c  mass  and  abundanceV^and.  the 
empi!rical  formula  of  a  compound.      *  '       *'  . 

Five  problems  on  the  relationship  between,  moles 
and  gram-atoms,  use  of  Avogadro's  number, \nd 
weight^weighL  stoichiometry.  ^    •     \  ^ 

Simulation  of  stoichiometry  experiment  to  find 
the  empirical  formula- of '  a  "copper  and  sulfur 
compouijd;  consisting  of  two  parts,  ah  'experimental 
section  and  a  section  of  questions  relating  to  the 
properties  of  the  substances  used  ^and  experimental 
,  techniques.. 

Six  .problems, on  pressure,  volume  and  temperature 
relationships  of  ideal  gases. 

Four  problems  covering  gas  laws,  Graham's  law  o'f 
diffusion  and  Dalton^s  law  of  pafH^^l  pressures. 


Stoich:  k 


Expert :  1 


Gas X 
Gas :  2' 


Collig:    1  Three  problems  on  colligative  .properties  of  solu-  ' 

tions:    freezing  point  depression,  boiling  point 
elevation,  molal  solutions  and  molecular  weight 

•  determination. 

Bonding:    1        Twenty-one  questions  reviewing  periodic  properties 

/of  the  elements,  quantum  numbers,  magnetic  properties 
Vno  calculations  (periodic  table  is  necessary). 

Faraday:    1        Ten  problems  on  the  application  of  Faraday's  law: 

-^mole-faraday,  amp- coulomb ;* current-time  conversions 
and  the  amount  of  material  involved  in  oxidation-re- 
duction reactions. 

Aqeq;    1  Seven  questions  discussing  acids,  properties  of 

•  electrolytes,  dissociation  and  law  of  mass  action; 
'no  calculation^.  ,  ' 
Aqeq:    2  Four  problems  covering  hydrolysis*  reactions, 

dissociation  constants,  .mass  action  expressions,. 

Aqeq:    5     '  •      Four  problems  calculating  hydrogen  ion  concentra- 

.    tion,  pH,-  hydroxide  ion  concentration,  pOH,  and  the 
dissociation  constant  of  an  acid.  '  : 

I 

Aqeq:  '  S  Two  questions  and  two  pjroblemS  discussing  properties 

,    .  of  buffer  solutions,  cbncentrations  of  species  in 

solution,  and  diprotic  acids. 

Aqeq:  '6^       *    Three  problems  on  solubilitjr  calculations. 


Many  ^freshmefi  have  difficulty  with  chemistry  because  they  are'  un- 
willing to  master  the  required  chemical  arithmetic.   .This  CAI  faci- 
lity eliminates"^  the  drudgery  of  working  problems  beca^use  the  students 
use  the  computer  to  do  the  calculations.    We  believe  that  this 
teaching  aid  will  make  learning  chemistry  more  interesting  for 
average  and ^ below  average  students.  -  ^  ^ 

Other  Featured      ,  '  ' 

*        <■  *  ' 

These  lessons  .are  written  so.  that  at  any  "question,  if  the  stu- 
dent needs  more  information,  he  may  ask  for  *^help".  .The  following 
example  from  Aqeq:    6  illustrates^  this  fea-tuxe  along  with  the 
ijfindom  number  and  arithmetic  features. 


EXERCISE:    THE  IONIC  SOLID  MX2  DI^SOcfATES  AS  FOLLOWS: 


MX?<r— =>  M(++)  +  2X(-) 
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IF  THE  SOLUBILITY- PRODUCT,  KS^  IS  6>k>9E-l6, 
CALCULATE  THE  SOLUBILITY.  OF  MX2  IN  PURE  WATER. 

-  '  ■■       '        .       .     <•  -     , ' . 

>helP        .  .   .  . 

WANT  S(m  MLP  AT  THIS  POINT?  ..- 

LET  ^  EQUAL  '^HE  SOLUBILITY  OF  MX2  IN  MOLES /LITER.  '  . 
THEN  n- FOLLOWS  THAT  IN  THE  SATURATED  SOLOTION,  ,     '  ^  . 

(M(++))  i=  Z  MOLES /liter         '  •        .  -  . 

'  '     (X_(-))  >.2*Z  MOLES /liter  -  '  .  -  * 

AND  (M(++))Mx(-.)-)**2'=  KSP  ,       .  '  "    .  ; 

NOW  TRY  THE  PROBLEM  AGAIN.  "      *  •••  "  ■      '      •  •  . 

I  .  * 

>{6.409E-l6/ii)«<-(l/5).  '  '     ■  ■         '        .  \ 

RIGHT.     NOW,  .   .   .  '  /  .  ^ 

Here  the  value  6.ii09E-l6  (6.i*'t)9  X  lo""^"^)  i'S^a.rarldoin  numbed  tinlif'OijnIy  \ 
distributed  between  i|*670E-l6  and  i^.670E-10,.   Tlie  student  a&ks.fdr" 
help,  receives  a  hint  and  is  invited  to  try  again.    He  rep}?eSents 
the  answer  by  the  arithmetic  expression,  (6.^09E'l6/i+)*^(l/5),  i^hich 
is  equivalent  to  |-6.  ^09xiO"^Pjl/$  >  "/    \  '     \   '  ' 

The  student  is  given  two  or  th^ee  chances  to  respond  .6orrectly* 
If  he  fails*  to  get  the  right  ansVer  the  comptiter  supplies'  it.  ^Aiso, 
hints  are  sometimes  automatically  given  as-  £n  the  next  -example.* 


IF  YOU  REACT  l,25i^Ei01  GRAMS  OF  SILVER  AND  6.27.ilE+00  GRAMS  OF  SULFUR 
WHAT  WEIGHT  OF  SILVER  SULFIDE  CAN  BE  FORMED?  '       ,  *  ,  . 

(MOLECULAR  WEIGHT  OF  (ag)5S=2ll7. 9  G/MOLE)       '  ^  ^  ^'"^ 

>help  ,  ^ 

2  Ag  +  S  >  Ag25 

CALCULATE  THE  G-ATOMS  OF  SILVER  AND  SULFUR.  .DETERMINE  .WHICH  REAGENT 
IS  LIMITING  AND  WHICH  IS  IN  EXCESS. 

>o  '  ;  - 

CHECK  TO  SEE  IF  YOU  CALCULATED  THE  G-ATOMS  OF  EACH  ELEMENT  .  • 

CORRECTLY:  , 

1.25'^E+Ol  G     1  G-ATOM/lCr/.S?  G  =  I.I65E-OI  G-ATOMS  Ag 
6.27'^E+OO  G  *  1  G-ATOM/52.1  G  =  I.96OE-OI  G-ATOMS  S  , 

THEREFORE  SILVER  IS  THE  Lll^^ITINQ  REAGENT.  '  . 

■241 
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•  THE 'solution. -IS:  .    -  " 

WT.  OF  (Ag)2S+WT.  OF* Ag,  IN  GRAMS  *  MW  OF  (Ag)2S)    /    (2*AW  OF  Ag) 

=  l.^i+E+Ol  G  *  21+7,9  G/m5LE  *  1  MOLE/ ( 2*107. &7  G) 
•THE  ANSWER  IS  l.Ul+OE+Ol  GRAMS. 

.    YOU  RAVE  MADE    6  MISTAKES.        -  '  , 

YOU  HAVE.  NOT  COMPLETED. THE  ENTIRE  LESSON.  ' 
PLEASE  REVIEW  SOME  MQRg  AND  COf-IE  SEE  ME  AGAIN  SOON.'         _  v 

HAVE  A  GOOD  DAY.' 
M:END  0}^  JOB      '  • 

Other  features  including  "skip",  "b*£ick"  and/*eoinraent";  are 
"being  coded  into  the  CAI  lessons  for  more  versatility.    If  a  stu- 
dent wishes  to  skip  the  question  just  presented,  he  can  tyj)e  "skip" 
Or  and  he  is' branched  to  the  next  question,    The^back.  feature 

^  /  allows  a  studi^nt  to  go  back  question  by  question  if  he  wishes,  to 
review  a  previous  problem.'   He  accomplishes  this  by  typing  "back" 
or  "-1".  The  "coiument ".'feature  enables  a  student  to  type  his  opinion 
,oS'the  lessor),  05  any. sugge stolons  at  the  terminal  while'  thoughts 
arQ  still  fre^h  ia  his  mind. 

/    Also,  a  complete  management  system  Ijs  being  developed  to 
.  provide" -feedback  on  lesson         '  SpeJ^rSl  PIL  datasets  ,will  be. 
loaded  when,  execution  of  *  a  CATALYST/ iessonr;t6gins.    These  PIL 
datasets  will  record:/  whidh  ques-^ohs'  are,  answered  correctly  or 
incgryeclly/  how  oftel^a  help  or  hi.nt  is  us^ed,  acrd  -hoV  effective  it 
is  (i.e* Whether' the  qti^tlon  wasl- ^nsw^red  correctly  or  incorrectly 
'  'after  the  hint  is  ^presented-) ;  normal  b,M  abiap)rmal  termination  of  a  . 
'    lesson;  the  total  number  of  right,  and  wrong 'responses ;Hhe' fiumber  of 
lilies  each  question  was  .sklppe^  or- repeated;  and  the.  canments  and 
suggestions  typed 'by  the  student,  -  This  information  will  pifipoint  * - 
problem  *areas  in  the  lessons  "and  help  us  ct^eate  more,  effectrive^ 
teaching  modules,  '  '    '         '  ' 

.   Evaluatibn'   ■  *      .    '  *     *  y  .  '     *'    '  , 

*  *       So  f^r  student  feedback  hag  been  encouraging.    The ^students 
enjoyed  using\the  lessors/ and  a're  asking  for  more  programs*  Student 
'^piniori  obtained  fr^  an  attitude  questionnaire  inclicates  tlxa;)^  the 
time  required *>o.  use  the  programs  was  well  spent  and  that  CAi  1-6*  an 

«  effective  teachi^vg  tool..   Siudejrjts  ^Iso  recommended  contirji^ed  develop 
ment^  of'the' program  in 'freshmen  ch:emi5try  an^  that  ^  they  would. ; 
participate  In  .similar  programs  In  otl^r^ourse$  if  given  the  oppor- 
tunity,  ^Th^  following  stUdeht  comments  are^tak^a  froifi  the  CAI ^ 
questionnalcp':     •   ,  ^  *     -  ^      ,  .  ^ 

'     ■  '   r       ■■  M 

•      *  .       I.  -     .  .  •  ' 

•    '         ^    ^  ( 
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"I  found  working  with  the  computer  very  interesting  and  enjoyable. 
Because  of  this  experience.,..,  I 'am  thinking  of  taking  some  courses 
in  computer  programming. " 

"The  idea  of  computer-assisted  [instruction]  programs  is  great..." 

"Once  you've  read  the  material,  and  understand  it  (outside  of  the 
computer  room),  I  feel  it  can  be  an  invaluable  study  aid...  Thank 
you  for  the  opportunity. " 

"it  gives  me  more  confidence  in  my  work  and  mptivates  one  more.  I 
find  that  I*m  enjoying  chemistry  a  lot  more  because  of  it." 

"I  sihcerely  hope  tj:iat  the  computer  program  will  be  continued  and 
expanded  for  the  rest  of  Chem.  11  and  onto  Chem.  12  material. " 

"...I  am  really  glad  that ...  CATALYST  was  offered  to  me  and  I  hope 
that  mor^'  CATALYST  will  be  offered  for  the  rest  of  Chem.  11  and 
Chem.  12." 

"J  have  als'o'found  Jhis  method  very  effective  in  my  biology  course." 

"The  computer  is  very  wittyJ" 

I^edictably,  the  students  most  enthusiastic  aboxit  CAT  are  the 
A  and  B  students.-   Yet  the  emphasis  is  on  tutorial-drill  lessons  for 
average  and  below  average  students.    Most  of  the  students  in  this 
lattei:  group  "can  never  find  the  time"  to  locate  a  terminal  and  try 
out  the  programs.    It  is  hoped  that' with  increased  availability  of 
terminals  and  with  the 'development  <>f  a  Aore  complete  and  effective 
library  of^AI  prograitis,  this  motivational  problem  will  be  solved. 
We  plan  to  continue 'the  CAI-proJect  at  Pitt.    We  think  that  with 
care  we  can  provide  our  freshman  cheftistry  students  with  a  learning 
aid  that  will  make  the  study  of  chemistry  more  interesting  and 
exciting/  And  we  think  we  can  provide  the  instructer  with  a  tool 
that  -will  make  his  teaching  mor'e  effective. 


*  Direct'  Classroom  Use  of^^a  Time-Sharing  Computer 
in  Chemical  Education 

Philip  Ec  Stevenson 
Department  of  Chemistry 
Worcester  Polytechnic  Institute 

•1.     General  Considerations 

• 

There  are  many  topics  in*  the  undergraduate  chemistry  curriculum 
where  complicated  mathematical  formulas  are  necessary  for  quantitative 
^descriptions  of  phenomena.     These  topics  range  from  ionic  equilibrium 
in  freshman  chemistry  to  advanced  areas  in  quantum  mechanics  and  sta- 
tistical mechanics.     While  teaching  these  topics,  the  instructor 
equipped  only  with  the  conventional  blackboard  and  chalk  simply  cannot 
demonstrate  quantitative  application  of  these  formulas,  except  perhaps 
by  reference  to  previously  calculated  quantities,  or  by  the  use'  of 
hand-waving  approximations   (a^  in  ionic  equilibria) ,  or  by  the  use  of 
very  simple  idealized  cases   (such  as  the  Ideal  Gas  Law) . 

However,  with  the  use  of  special  x::omputer  programs,  previously 

Q 

written,  debugged,  and  loaded  into  a  time  sharing  computer  system  (such 
as  the  RCA  Spectra  70/46  at  WPI)  ,  an  instructor  wit^a  remote  time 
sharing  terminal  at  his  side  can  demonstrate  jthese  quantitative  calcu- 
lations .to  his  classes.     He  need  only  type  in  a  line  or  two  of  input  and 
wait  a  moment  for  the  results.     The  student  question,   "What,  happens  xf 
we  change  the  value  of      .  to  is  easily  answered. 

In  this  paper  I  will  discuss  a  number  of  such  programs  that  1  have 
written,  some  with  the  assistance  of  WPI  undergraduates  Completed 
programs  include /'DIATH2 , "  for  computation  of  the  binding  energies  of 
H2  and  H^'^ ,   "SHMO"  which  performs  Hiickel  MO  calculations,   "ACID"  and  ^  ^ 
"BASp"  for  simulation  of  weak  acid-strong  base,  and  weak  base-strong 
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acid  titrations,  respectively,   "GASLAW"  for  calculation  gf  pressures  by. 
van  der  Waals '    law,  and  **FEF"  for  calculation  of 'free  energies,  of  re- 
action  fori  arbitrary  temperatures.     Programs  in  development  or  Sipntem- 
plated  include  "SALT"  for  simulation  of  precipitation  titrations  and  ' 
programs  in  the  areas  of  reaction  kinetics  and  statistical  mechanics. 
First,  however,  I  would  like  to  discuss  some  of  the  general  aspects  of 
writing  such  programs.  ^  c 

To  begin,  one  must  choose  a  computer  language.     At  WPI ,  this 
choice  is  among  BASIC,  FORTRAN  TV,   and  TFOR  (time-sharing  fortran) . 
Each  language  has  its  own  advantages  and  disadvantages^    FORTRAN  IV  is 
computationally  the  most  ^werful  of  the  three.     This  computation  power 
was  necessary  for  "DIATH2"  which  required  double  precision  arithmetic 
for  integral  evaluation  and  for  "SHMO"  .which  utilizes  "EIGEN"  from  the 
IBM  Scientific  Subroutine  Package  for  matrix  diagonalization .  "ACID" 
,and  "BASE"  are  ih  FORTRAN  IV  because  they  were  developed ^ from  a  prer 
vious  batch  program  written  'in  WATFOR.     In  addition/  source  program:?.  '  t 
FORTRAN  IV  alP  normallij  in  card  form  and  can  be  s^^fely  stored  and  easily 
be  edited,  dupli(?ated  and'  distributed.    FORTRAN  IV  has  three  serious 
disadvantages,  however*'    Any  change  in  a  program  requires  a  batch  run, 
so  one  must  wait  for  whatever "  turn  aro^iVid  time  one's  computer  ^.center 
imposes,  in  order, to  check ^bhe  results  of  the  changes      Input  format 
requirements  are  fairly  jMgid.     For  instance,  one  tnust  right- justify 
all  exponential  and  int:eger  in'putc     Fa^nally  FORTRAN  IV  load  modules  in 
the  RCA  system  aVe  Lafge  and  non-sharable  and  sometimes  generate  paging 
problems.    "DIATH2"for  instance  has  a  load  module  that  is  close  to  LOOK 
bytes  long.     Four  users  executing  "DI^TH?'  S3.multan$ously  wov^ld  be  uti- 
lizing about  40*  of  the  available  drum  pages,  and  these  pagers  would  be 
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swapping  in  and  out  of  core  'often  enough  to  decrease  significantly  the 
CPU  efficiency.     TFOR  and  BASIC  on  the' other  hand  are  not  as  computa- 
tionally powerful  as  FORTRAN  IV,  but  are  interactive,  so  that  program- 
iDin5  errors  can  be  inmvediately  detected  and  remedied,  "have  relatively 
few  input  restrictions,  and  use  much  less  core  and  druiti  space  for 
object  coding*  /  )  , 

Having  chosen  a  language,  one  tnust  next  obtain  a  properly  execut- 

J 

ing  algorithm.     Qftea  one  already  has  this  in  a* batch  program  or  a  pro- 
gram obtained  from  elsewhere*     Then  one  must  deyise  an  input-output 
scheme  that  is  suitable  for  interactive  execution  of  the  program  and 
suitable  for  the  kind  of  terminal  which  one  will  be  using.     My  personal 
preference  is  for  teletype  because  it  is  the  cheapest  kind  of  terminal 
and  because  it  produces  hard  copy.     Its  disadvantages  are  noisy  opera- 
tion and  slow  output  (10  characters  per  second).     In. any  event,  one 
wishes  to4program  for  minimuift  ir\put,  and  relatively  small  amounts  of 

]  ' 

output.     One  should  keep  in  mind  .that  the  interactive  nature  of  such 
program^  allows  one  to  use  output  from  an  fearlier  part  of  the  program 
to  influence  input  tp  a  later  part  of  the  program.     If  one  has  written 
his  program  in  FORTRAN  IV,  one  must  provide  some  way  of  aligning  ihput 
to  the  expe^cted  i-nput  fojpnat.     I  do  this  by  output  ting  strings,  of  char- 
acters just  ah6ad  of  input  lines  which  show  me  whe^re  to  place  ,the  input. 
Finally,  it^is  a  good  idea  to  have  the  program  type  out  instructions 
for  its  own  use. 

The  kind  of  problem  one  would  choose  for  this  sort  of  interactive 
program  ideally  would  use  a  very  small  number  of  parameters  as  input, 
do  lots  of  computation,  and^^nerate  a  modest  number  of  significant 
results.     I  feel  that  the  specific  programs  which  I  will  now  discuss 
more  or  less  meet  these  criteria. 


0^'  ¥V> 
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TT.     '*nTATH2"-A  Program  for  Demonstrating  Chemical  Bonding  in  Hydrogen 

H2  and         are  traditionally  the  first  molecules  encountered 
by  the  student  of  quantum  ^chemistry .     At  some  point  in  his  intro- 
duction to  the  subject,  he  sees  his  instructor  dutifully  derive 
formulas  for  the  energies  of  the  valence  bond* function  for  H2  il) 
and  the  molecular  orbital  functions  of  H2'*"  (2)  and  H2^   (30-     He  may 
even  encounter  the  Weinbaum  (£)   and  configuration  interaction  treat- 
ments.    However,  when  it  comes  to  numerical *re3ults ,  all  he  encounters 
are  tables  and  perhaps  a  dissociation  curve  or  two.     He  never  sees 
nor  participates  in  the  process  of  calculating  these  results.     If  for 
this  lack  of  participation  he  concludes  that  quantum  mechanics  is  a 
"black  art",  he  is  not  without  some  justification. 

However,  with  the  aid  of  a  computer  and  a  program,  •'DIATH2", 
described  here,  it  is  possible  for  the  quantum  chemistry  teacher  to 
add  numerical  results  to  his  classroom  discussion  of  H.  and  H 
Using  a  remote  terminal  connected  to  a  time-sharing  computer,  he  may 
enter  arbitrary  values  of  the  internuciear* distance,  R,  and  Is 
orbital  exponent,  4,  and  obtain  the  resulting  energies  within  seconds. 
He  could  enter  the  "book"  values  of  R  and  C,  or  with  somewhat  more 
expenditure  of,  time,  search  for  their  optimum  values.     He  cotild  assign 
the  construction  of  optimized  dissociation  curves  as  homeworko 

The  program  treats  the  various  wavef unctions  for  H2  and  H2'*" 
which  can  be  constructed  from  Is  orbitals  centered  on  each  nucleus, 
namely  the  valence  bond  function  of  H2 ,  the  molecular  orbital  func- 
tions for  1^2*"  and  H2  and  the  simplest  configuration  interaction 
function  for  H2 .     The  Weinbaum  treatment  is  omitted,  since  it  is,  ^ 


formally  identical  to  the  configuration  interaction  treatment.  The 
program/  wi;itten  entirely  in  FORTRAN  IV,  takes  the  input  values  of 
R  and 'C/  converting  R  ifito  atomic  units^  if  necessary^  ahd  evaluates^, 
all  the  necessary  integrals  involving  components  of  the  Hamiltonian 
operators  over  Is  orbitals   (Eq,    (1))  on  the  two  nxiclei,  . 

is=%'/WV"  (i)  ' 

The  program  then  evaluates  the  electronic,  total,  and  binding* 
energies  for  the  valence  bond  wave  function,  the  molecular  orbital 
wave  functions  for  H2"*"  and  H2,  and  the  configuration  interaction  wave 
function  •    At  the  users  option,  any  ar  a^l  of  the  output  may  be  "^ty^e^ 
at  the  terminal  for  immediate  inspecrion.     Any  output  not  typed  is 
printed  off  line  for  later  use.     An  auxiliary  subroutine  "PARAB" ,  is 
available  which  constructs  a. parabolic  curve  when  given  three  points. 
It  also  gives  the  extreme  point  of  the  curve.    This  is  useful  when 
one  is  jSptiTnizing  for  5  or  R  and  desires  a  quadratic  interpolation. 
The  use  of  the  input  distance  R  =  0,0  results  in  output  pf  the  elec- 
tronic energies  of  He"*"  and  He, 

I  have  used  this  program  in  my  Molecular  Orbital  Theory  course. 
Student  enthusiasm  was  high^  and  number  of  points  were  brought  out  and 
clarified  which  are  often  confusing  or  not  appreciated^     These  points 
included  the  incorrect  dissociation  of  the  molecular  orbital  function, 
the  united  atom  picture  for  electronic  energy,  and  the  role  of  orbital 
contractiono     Application  of  the  variation  theorem  was  rather  effec*- 
tively  demonstrated.    However,  in  the  course  of  an  eighty  minute  class 
period,  there  was  only  time  enough  for  a  thorough  discussion  of  ^2'^^^' 
Thus  in  future  years,  I  plan  to  assign  homework  problems  invo'lving 
student  use  of  the  program,  as  well  as  use  it  for  demonstration  purpose 
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III,     Ionic  Equilibria  Progrars 

Ionic  equilibrium  is  traditionally  one  6f  the  most  difficult 
topics  in  the  freshman  chemistrj^  syllabus.     The  combination  of  complex 
relationships  among  the  variables,  the  need  for  such  complicated 
mathematical  techniques  as  the  method  of  successive  approximations, 
and  the  need  to  recognize  which  variables  ccin  be  neglected  in  which 
equations  proves  to  be  the  downfall  of  many  a  studentp  Demonstration 
calculations,  though,  pedagogical^y  necessary,  are  tedious  and  time 
consuming  for  an  instructor  to'perform  in'clasgo  . 

We  are  writing  a  series  of  demonstration  programs  which  simulate 
titrations  •     "ACID"  computes  a  pH  v^l  volume  of  base  added^^curve  for 
the  titration  of  a  monobasic,  dibasic,  or  tribasic  weak  acid  vi^h  a 
strong  baseo     "BASE"  is  the  analogous  program  for  a  rnondiicidic,  dia- 
cidic,  or^triacidic  weak  baseo     Both  "ACID"  and  "BASE"  can  also  give 
concentrations  of  all  substances  at  any  point  in  the  titrationo.  "SALT", 
which  is  still  in  development,  will  simulate  precipitation  " titrations 
in  which  the  kinds  of  ions  produce  slightly  soluble  1:1,  2:ly  3:1,  or  ' 
3:2  precipitates o      ^  ^  ' 

One  might  better  appreciate  the  value  of  a  simulated  titration 
if  one  recalls  that  titrations  unify  the  problems  of  weak  acid  and 
base  pH's,  buffers,  hydrolysis,  salt  solubilities,  and  common  ion 
effects.     A  detailed  discussion  of  "ACID"  will  help  clarify  this* 

3 

One 'deals  with  the  equilibrium  of  the  weak  acid  HA  with  its  ions; 

HA(aq)       H+  +  A"  ^  (2) 

(H^].(A-]  ^        '  (3)  # 


(HA] 


13 


erJc 


the  self  dissociation  of  water; 

^  .'.       .         "         ■  -     '  -  / 

H2O  =  H"*"  +  OH"  •  (4/ 

iHt)  [0H-]  =•        -  1.0  X  10*^**  •  (5) 

the  conservation  of  the  initial  amount  of  kcid; 


(HA]  +«  [A-]   -         r  ^a  ■    -J  (6) 

,         '      •        '    ^a-^;^b     ■  • 

and  the  conservation .of  electric  charge; 
«.  •  ,  • 

(H+]  +,  tNa+]  ■»   (OH-]  +   [A-]  (7) 

The  fJa"*"  ion  concentration  is  known  to  be: 

"  (Na+]   -  F^^[  )  '     .  "        (8)  > 

• .  .  at)  » ' 

.Eqs.   (6)^(7),  and  (8)  assAJUn^  »that  a  g^iven  point  in  the  titration 

is  attained  by  mixing       ml  of  weak  acid  , of  .formality       with  Vj^  ml 

of  NaOH  of  formality  F,  o     The  progr^,  j.n  fact,  assvunes        «  50o0  ml 

D  *  ^        ,  a 

*   and  accepts  F  ,  F,    and  -K    as  input  i)arameters,    V,    is  the  .independent 
>a      o         •  a  i  o 

variable  in  the  titration  and  is  automatically  incremented  from  OoO  ml 
to  a  point  20%  past  the  equivalence  pointo     The  algebraic  solution  of 
Eqs.    (3),   (5),   (6),   (7),  and  (8)   is  programmedo     The  v.ari2ibles,  [OH*] 
and  [A*]   are  thu«"  eliminated  from  Eq,   (7)  and  the  resulting  cubic 
equation  in  ^H"*")  is  solvec^  by  the  Newton-Rciphson  methodo 

One  would,  of  course,  never  go  through  this  complete  solution 
in  a  har^d  calculation,  but  would  instead  neglect  certain  quantities 
in  Eqs,    (6)  ^ind  (7)   and  then  plug  the  equations  into  Eq.   (3)  ,  . 
resorting. to  the  method  of  successive  approximations  as  ^  lalt  resorto 
»  However,  most  freshmen  lack  the  chemical  intuition  to  be  able  to  know 
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what  quantities  to  neglect  and  when  to  neglect  themo     Since  "ACID"  . 
calculates  the  concentrations  of  .all  the  ions  in  solution  without       -  v 
neglecting  any  of  them,  a  freshman  chemistry  instructor  can'  use  it 
in  class  to  give  his  students  some  feeling  for  the  numbers  respiting 
from  ionic  equilibrium  calculations o    .Thus  students  could  begin,  to  * 
acquire  the  chemical  intuition  which  they  ne6d  in  .order  to  masjter 
the  subject  of  ionic  equilibrium* 

I  have  use4  "ACID"  experimentally  in  n^y  two  recitation  sections  ■ 
•in  CH102  (freshman  cnemistry)  at  WPI.     An  ASR-33  teletype  was  set  up 
at  the  side  of  a  30  seat  clcissroom  using  a  TV  camera  and  monitbr  Xo 
show i  the  results  to  the  class.     Communication  with  the  RCA  Spectr-a^  70/46 
compdter  .at  the ^Worcester  Area  College  Computation  Center  was  by  tele- 
phone**  I  spent '  approximately  ICt  minutes  at  the  beginning  of  class 
discussing  Eqso   (2)   through  ^(8).     Then  I  obtained  the  titration  curve 
of  50.0  ml  of  OolF  acetic  acid  with  .OolF  NaOH  and  drew  attention  to 
the  equivalence  point  pH  of  8o72a     The  specific  problems  of  the  weak 
acid  solution,  the  1:1  buffei  and  the  hydrolysis  of  the  salt  of  a 
weak  acid  problem  were  then  demonstrated  by  requesting  computation 
of  the  concentrations  of  all  species  in  solution  at  OoO  ml,  25.-0  ml/ 
and  50.0  ml  respectively  in  the  titratione     For'' these  problems,  all 
the  standard  approximations  are  valid  and  the  program  convinced  the 
students  that  the  concentrations  of  neglected  specijes  are  indeed 
at  least  several  orders  of  magnitude  smaller  than  those  of  the  major 
species,     There^ remained  time  for  only  one  additional  titratione     In  ^ 
one  claps,  titration  of  a  moderately  strong  weak  acid  (K    =  5  x  ^O""^) 


was  demonstrated;     in  the  other  the  titration  of  a  very  very  weak  ^ 


acid  (K_  =-loO  X  10*"'  )  was  demonstrated^     In  these  two  cases  at 
a  i 
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least  one  approximation  breaks  down  in  each  of  the  standard  problems. 
For  example,  in  the. hydrolysis  of  the  salt  of  the  very  very  weak  acid, 
the  self-ionization  of  water  (Eqs.   (3)  and  (4))  is  significants  These 
breakdowns  wel^e  clearly  demonstratedo. 

Student  response  to  the  program  is  difficult  to  gauge  at  the 
time  of  this  writing  {the  third  day  following  the  demonstration).  The 
-students  ask^d  very  few  questions  and  there  was  little  spontaneous 
response  in  class.    After  cl^ss,  however,  one  student  asked  me  how  to 
execute  the  program  from  th^  student  teletypes;    another  complained  of 
poor  legibility  of  the  nurcihers  on  the  Screen  (a  minor  change 'in  format 
will  alleviate  that  problem) ;     and  d  third  said  the  demonstration  was 
fascinating.    The  real  test  will' be  whether  t>ie  demonstration  enables 
the  students  to  show  an  improyed  performance  0n  their  d^and^ations« 
My  feeling  is  that  thiS'Will  Fiappeno-        — , 


IV.    other  Demons tratx^pa  Pyograms     .  *     .  .       .  % 

'  '  i      >  ♦ '        •  •  * 

a.  Ouan1:uin  •Gh>ndsfcry  *      .         ,      ^   ^-^  ^  ^    .  . 

I  have  written  an  interactive 'program;  *'SHMOV  vhich.  performs 

simple  Hutkel  i^o-reculdr  Oxbit^V^alGuiations .     Input  consists  of  the  * 

matrix  dimension,  number  of  ^  electrons  and*  the^  Hamiltoniah  itiajt:r:i;x*'ele-^ 

ments  i-n  units  of  6,     t-eletype  output  consists  of  eigenvalues,  chargeV  ' 

densities,  and  hbncJ  or<^^erg  for  bonded  atom  pairs.   /In  addition  /  a  com- 

•  •  •  *  <  * 

plete  ou1;put  list  is  generatej^  off  line..   The  program  makes  use' 'of  the 
SSP  matrix  diagonalizat^^on  subroutine  "EIGBN.'^  In  addition^  my  •research 
level  program,  "NEMO"   (for  Noh  Empirical  Molecular:  Ortitals   (5))   is  set* 
up  for  interactive,  teletype  execution,-    Here  too,  a  selegCion  of  *  output  , 
is  typed  on  teletype,  ajid  the  complete  output'  Tist^ is  generate;^!!  of f 
line.     I  make  use  of  these  two  programs 'and  "DIATIJ2-*  in  my  quantum- 
theoi^^  co'urse. 

b.  statistical  Mechanics  '  ^ 

I  havd  recently  ass.igned  a*  freshman  hon9r9- lab  studeilt '-to  pro-^ 
gram  the  following  pxolplem:     Given  a  fished  number  of  icientical  -^^ndi- 
stinguishable  partlples  possessi-ng  a  f>^ed  total  energy,  ctnd  free  tp 
move  in, a  spectrum  of  ^  evenly  -spaged  one  particle  energy  levels  compete, 
the  probability  distributipn  of  ^ particles  in  these  levels.     This  pro,- ^ 
gram  is  intended  t6.  generate  a  feci t scmann  diist^ibut ion ^.by  ,simf)le  pount- 
ing  of  particles  and  will  compute  dist;t;ibutions  for  large  values' of  N  • 
and  E  following  a  hand  calculated  deiootis tratiop  for  s'mall  nuntoeir^*'s*Qch" 
•as  M  =  3  paxticles*  and  E      6  units  of  energy.     The  Energy  spectrum  is  ' 
,  assumed  to  consist  of  evenly  spaced  levels^ with  separations  of  ofie  unit 
starting  at         t)      '  ^         '  *  '  '  ' 

c.  Thermodynamics  '  -     '    *  •  " 
One  of  my  students  has  just  'completed  a  program,  ''GAgi^AW/' 

which  computes  the  Ideal  and  van  der  Waais'  pressures .  af' gases ,  given* 


T,  V,  and        as  input.     A  table  of  pararoeters .  a  and  b  for  common  sub- 
stances is  stored  in  the  progrsim,  but  the  us^er  can  supply  his  own 
values  if  he  wishes/    T|iis  program  can  be  used  to  p;:ovide  ,a  quantita- 
tive baais  for  discussions  of  devi'ations  from  the  Ideal  Gas  Law. 

She  has  hlso  nearly  completed  a  program,  "PEF^"  which  tests  the 
-validity  of  the  approximation 

AH*  +  Ta'S*  .    •  •    ' '    -  (9) 

where  AG  is  the  free  energy  of  a  chemipal  reaction  at  an  arbitrary  tem- 
peifa^ure  T  and  AH°  and  AS°  are  the  .standard"  (T  =  298°K)  changes  in 
e^nthalpy.  and  entropy  .respectively.     The  test  is  through  comparison  with 
the  'itfore  .accurate 

,  .-V  *     AG-=  AH(T)  •+  TAS(T)  "  •  '  (10)  - 

whej:§^  heat  .-pdpacity  data  are  used  to  evaluate  ^!:H(T)  and  AS(T}  explicitly 

for  the  given  temperature  T.    A  table  of  heat  capacity  coefficients  is 

.'-^  '  '  .  * 

'stored  itj  the  program.  •  •  .       _  .  ^ 

^  "  '     Another  student -is  working  on  a  program  incorporating  van  der 
Waals^   law  and  the  criterion  of  inherent  st^ility  to  generate  P  vs.  V 

•isotherlns  to  .derponstrate  the  phenpmenon  of  condensation  of  a  gas  to  the. 

liquid  state.         *  •        v  . 

*  M.   '  Chemical  tcinatics  • 

-  ...Afiother  .student  is  starting  a  project  to  simulate  the  time 

develQpment  of  chemical  reactions  for  a  variety  of  reaction  mechanisms. 

He  ^hai-^. Already  written  a  program  for  the  one-step; •  first-order ,  irre- 

'versible  reaction  •  '  * 

'    ,^  r-^^  B  _  ;  (11), 

and  is  working  on  the  conaecutiye  two  step  r^actit>n' 

A  — 5-.>  B  — ^>  a  ,  (12) 
where  both  .reactions  are  fifrst  order  and  irreversible. 
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V.  Program  Distribution 

"X)IATH2"  is  currently  availcU^le  through  the  Quantum  Chejnis.try 
Program  Exchange,  Chemistry  Department,  Room  204,  Indiana  University, 
Bloondngton ,  Indiana  47401.    As  the  remainder  of  programs  becwne  ready 
for  distribution,  they  will  also  be  available  from  QCPE. 

VI.  Acknowledgments. 

To  Dr.  Norman  E.  Sondak  and  the  Worcester  Area  Co.llege  Computa- 
'tion  Center  for  furnishing  computer  time  j^d  advice  on  specific  pro- 
gramming  problems. 

1^  the  following  WPI  undergraduates  for  assisting  on  this  project: 
Da'^id  Lyons,  Janet  Merrill,  Robert  Schlain,  Benjamin  Thompson,  and 
cnristopher  WilliamSc  .  ' 

Footnotes ;  .  ^ 

1.  These  formulas  can  be  found  in  any  stand^ti:^  introductory 
(Quantum  chemistry  text.     For  exafl^le:  '  L.  Pauling  and  E.  B. 
Wilson,  Jr.,  "Introduction  to  Quahtum  Mechanics , "  McGraw- 
Hill  BooH  Co.,  New  York,  1935,     pp.'  326^353.  /  ^ 

*^  •  -  '  ^ 

2.  The  specific  foxmaiad 'are  not  repeated- here  since  they  are 
readily  available •  '  .  ' 

3.  This  discussion  is  limited  to  the  case  of  the  m6noba^ic  weak  . 
acid  HA,     However  the  program  algorithm  is  capable  of  hand- 
ling H^A  and  H3A,  as  well.  ^  '    '  ,^  *\,.-- 
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Bell  Labpjatories 

For  years  Theoretical  Chemistry  has  existed  as  two-  distinct 
disciplines;  Quantum  Chemistry  which  has  been  mainly  concerned 
with  the  electronic  structure  of  isolated  molecules  in  their 
ground  and  excited  states,  and  Statistical  Mechanics  which  has^ 
developed  sophisticated  ways  to  deal  with  th6  macroscopic 
properties  of  ensembles  of  weekly  interacting  molecules.  The 
members  o^  these  disciplines  have  had  rather  little  to  say  to 
one  another,  even  though  the  molecular  i^iteraction  potentials 
required  for  the, theories  of  statistical  me<:hanics  were  almost 
sure  to  be  generated  by  the  Quantum  Chemists . 

The  Theoretical  Chemistry  session  exhibits  the  first  stages 
of  -a  joining  of  these  disciplines  which  will  initiate  a  fruitful 
new  age  of  Theoretically  Chemistry.    The  member  role  of  the 
computer  in  making  this  synergistic  match  will  be  appiirent  from 
these  t^Iks. 


COMPUTERS  AND  THE  SCHRODINGER  EQUATION  AS  A  RESEARCH 
INSTRUMENT  AND  A  TEACHING  TOOL  . 

Arnold  C.  Wahl,  Chemistry  Division 
Argonne  National  Laboratory,  Argonne,  Illinois,  6043.9 

^    .  Introduction 

■  There  exists  a  wide  spectrum  of  computer  appli-'^ 
cations  in  chemistry  research  and  teaching  as  evidenced 
by  this  cpnference.    These  applications  include  sophis- 
ticated interactive  automation  of  and,  data  processing, 
for  experiment&1^2^  the  exploration  of  organic 
synthesis3,  semiempirical^  and  a  priori^  theoretical 
calculations,  in  addition  to  a  wide  variety  of  simula- 
tion studies  in  chemistry  and  chemical  engineering^. 
In  many  ways  these  computing  systems  'can  be  used  as  or 
a&  part  of  .a  research  or  engineering  instrument  or  as  a' 
^teaching  tool.    One  of  the  most  intriguing  and 
prophetic  applications  of  the  computer  which  has 
developed  dramatically  over  the.  pkst  10  years  is  iri  the 
a  priori  modeling  of  atomic  and  molecular- s true tu re ^ ^nd 
TundamentAl  chemical  processes.    -The  reason  that  this 
area  is  ^exciting  is  j:hat  It  provides  a  means  for,  ob- 
taining chemical  information  independent*  from  experi- 
ment.   Further,  if  reliable  a.tomic  and  molepular 
characteristics  can  be  obtained  from  such  a  priori 
theory  they  then  can  t)e  fed  into  the  'lAathemallcal 
machinery  of  thermodynamics,  molecuJLar  dynamics  'and  . 
statistical  mechanics  to  yield  the  bulk  properties  of 
matter,  again  independent  frcs^n  experiment.    We  must, 
of  course,  be  very  cautious  in  making .^such  claims  and 
our  ability  to  make  them  at  present  re'^sts  ultimately 
op  our  facility  in  approximately^  solving  the  Schrodinger 
equation  for  groups  of  atoms  and!  molecules..    In  the 
next  section  we- will  brief ly  discjjss  this  capability 
and  then  illustrate  how  it  cknvbe,  conveniently  ^used  in 
research  and  teaching.     ,      ~  >  . 
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Various  a  priori  Methods  for  Approximately  Solving  the 
Schrodinger  Equation  for  Atoms  and  Molecules 

A  wide  variety  of  methods  have  bee'n  .developed 
both  formally  and  tomputationally  for  obtaining  molecu-' 
lar  wavefunctions  and  properties?^   There  e,xist  both 
variational^  ,  semi-variational    and  non- variational, 
schemes.     In  accessing  the  utility  of  'tliese  methods  it 
, becomes  clear  that  they  are  in  general ^fairly  costly 
and  piust  be  pushed  close  to  their  "engineering"  limits 
in  order  to  reliably  obtain  intei:ac'tion  and  excitation* 
energies .within  ±  .lev  and  other  properties  within  5^. 
Often  this  or  higher  accuracy  is  needed  if  even  quali- 
tative contact  is  to  be  made  with  modern  experiments. 
It  is  also  clear  that  except  in  special  casefe,  of  whiclL. 
there  are- many  examples,  the  Hartree  Fock  mcSdel'  cannot^ 
yield  chemically  ac.cuiiate  potential  curves  $ind  surfaces 
due  to  its  two  .shoi'tccmings.     1)  The  constrained  form 
of  the  molecular  orbital  picture  which  forces  dis- 
sociation of  the  system  to  improper  asymptotic  frag- 
ments and,  2)  the  increase  in  correlation  energy  • 
associated  with  molecular  formation.- 

Although  waveftinctions  of  neai'ly  Hartree  Fock^^ 
quality    have  been  computed -for  surfaces  and  large 
molecules  Tor  some  time  ,  it  ha^s  only  been  recently  " 
that  methods  beyond  the -Hartree  Fock  level  have  yielded 
useful  precijaion  (except  foy  Hj)  in  potential  curves 
and  surfaces-r^  Another  recent  advance  which  opens  many 
pirevious'ly  ift'achiev^ble  areas  of  chemical  interest  to. 
a  priori,  techniques  is,  the  apparent  ab^ility  tp  evalu- 
ate Van  der  Waals  interaction  with*  a  precision  of  about 
lOfo  and  possibly  better^.^^ 

»  In*  summary -a  combination  of  Hartree  Fock, 
'Mwlticonfigur9.tlqn.  Hartree-Fock,  NaJ;ural  orbital, 
St^raight  configuration  , interaction,  calculations,  and 
pair  theory  when  appropriately  used  shov/  promise  of- 
producing :in  a  systematic  way  chemical  bond  strengths, 
.long  range  interactions,  energy  surfaces,  term  values 
and  majiy  molebular  properties  at  a  level  of  ' accuracy 
useful,  to  the  clr^eTtiist.    'Furthe'r ,Qonservative.  estimates 
o-f -future  advahcesin  theory,  techniqui?  and  computer'  * 
technology  predict  that  such  calculations .will  become 
an  everyday  tool  of  the  chemist.  ^  In  the  next  section 
I  will  present  some  .t^pi-ca-l  ex:araples  of  such  .  . 
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calculations  and  theri  discuss  how  this  relatively  new 

quantitative  facilitfy  can  be  interfaced  utilizing  some 

ooncepte  of  computer  interviewing  and* artif ical 

intelligence  with  the  hon  specialist  and  perhaps  with 

the-  student. 
< 

Some  Examples 

All  of  the  above  methods,  when  pj:'operly  imple- 
mented and  thoroughly  applied  are  capable  of  yielding 
cheriiically  accurate  values  of  many  atomic  and  molecular 
properties  as  well  as  a  good  description  of  the  inter- 
action "Energy  between  atoms  and  molecules. 

I  would  now  like  to  briefly  review  the  method  and 
techniques'^  (Optimized  Valence  Configurations)  which, 
we  have  been  developing  in  our  lab'oratory  and  illustrate 
their  application  to  three  areas:     1)  Chemical  bonding, 
2)  Van'der  Waal  interactions,  3)  Chemical  reactions. 

The  essence  of  our  scheme  named,  "Optimized 
Valence  Configurations"  is  to  take  into  account  only 
those  changes  taking  place  as  the  molecule  forms. 25 
This  goal  is  achieved  by  1)  ^including  those  configura- 
tions' necessary  for  proper  dissociation  of  the  mole- 
cule to  Hartree  Fock  atoms,  2)  including  all  configur- 
ations which  exist  in  the  molecule  but  vanish  formally 
in  the ' dissociated  atoms,  3)  including  configurations 
which  describe    mainly  atomic  correlation  but  whose 
sContribution  nhanges  within  a  numerically  defined  . 
threshold  of  significance  as  the  molecule  forms.  An- 
important  fea,ture  of  Qur  method  is  that  We  use  the 
multiconfigurational  SCF  (MCSCF)  technique    to  form 
all  orbitals  occurring  in  all  configurations.     This  has 
the, very  important  feature  of  reducing  the  number  of 
necessary  configurations  drasticall;^,  since  each  con- 
figuration is  made  as  effective  as  possible.     The  first 
two  classes  of  configurations  are  unambiguous,  rela- 
tively few  in  number,  can  be . identified  by  purely 
formal  considerations,  and  yield  a  potential  curve-  • 
which  is  accurate  to  within  ±  \3  eV.     The  third  class 
must  be  selected  .by  actual  computation  and  'are  re- 
quired for  more  precision  (±  .1  eV).. 
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'     Ghemlcal  Bonding. 

The  pVC  scheme  has  been  applied  to  a  variety  of, 
moIecules"'and  borid  types  and  is  yielding  adequate  pre- 
*cision  in  describing  molecular 'potential  curves. 2d, 35 

When  all'  the  important  .correlation-effects  within 
and  between  various  molecular  s'hells*  are .  investigated,, 
it  is  found  that  consistent 'wit^  the  philosophy  of  the 
OVC  method  these  effeo/Cs  do  indeed  fall  ir^to  two  fairly 
distinct  categories -.X^ the  molecular  correlation,  which 
vanishes  when  the  molecule  dissociates  into  the  con- 
stituent 'atoms,  and  the*  atopaic  correlation,  which 
passes  asymptotically  to  ,that  for  the^atoms.     The  very 
weak  interdependence  or  "coupling"  of  the  two  corre- 
lation"* types  r-esults  into  considerable  computational 
simplification  by  allowing  one  in  \the  spirit  of  pair 
energy  to  perform  MCSC-F  calculations-  on  'groups  con- 
sisting of  a  comparatively  small  number  of  confiffura-  ^ 
tions  and  to  sum  their  respective  contributions  Xo  the 
bouuing  interaction.    The  calculations  on  F2  yield  a 
.potential  curve  which  is  very  similar  to  the  one  de- 
rived from  experiment.     Our  calctilatedSo  values  of  De, 
De,  Re  are  1.6?,  9^-4,  2.66  compared  with  the  Experi- 
mental ones  of  1.68,  931,  2.68.  '  . 

Van  der  Waals  Forces 

Until  recently  little  progress  had  been'made^  on 
the  evaluation  of  Van  der  Waals  forces  by  a  priori 
variational  techniques.     Several  recent  caTcula tions, 
however,  suggest  that  the  following  procedure  may  be 
adequate  for  obtaining  weA  depths  and  position  within, 
ten  percent33,37.     1)  Obtain  a  high  precision  Hartree- 
Fock  interaction  potential;    2)  evaluate  the  intra- 
atom  correlation  using  only  the  doiirinant  atomic  cor- 
relating  MCSCF  orbitals  as  a  function  of  the  inter- 
nuclear  separation;    3)  evaluate  the  contributions  from 
the  dispersion  terms.     In  step  (2)  it  is  necessary  to 
include  the  terms  that  represent ' enhanced  correlation 
at  a  particular  atom  owing  to  any  significant  "transfer 
of  charge"  fi:om  the  other  center  in  the  Hartree-Fock 
approximation. 

Such  partitioning  can  result'  in  a  great  dea;i.pf 
problem  simplification.    Since  each  step  in  this 
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procedure  involves  oYily  a  f-ew  configurations  to  be 
optimized  by  the  MCSCF  process  as  opposed  to  a  much 
larger  number  If  all  affects  must  be  computed 
simultaneously.     Further  calculations  indicate  that 
even  higher  precision  may  be  obtainable  by  this 
prescription. 

Chemical  Reaction  Surfaces 

We  have  applied  the  techniques  found  successful 
for  evaluating  molecular  potential  curves  and  long 
range  forces  to  the  calculations  of  the  energy  surface 
for  the  endothermic  chemical  reaction^" 
Li  +  H2        LiH  +  H.     These  calculations  are  still  in 
progress,  however,  some  interesting  features  have 
already  emerged.     From  the  nature  of  the  energy 
surface  E(Rj^,Rt j^, @)  and  the  changing  form  of  the 
wa*vefunction  it  is  clear  that  reaction  only  takes 
place  if  the  Ho  bond  is  stretched  to  about  2.^  bohrs. 
^en  the  dynamics  involved  in  a  lithium  atom  colliding 
with  a  hydrogen  molecule  are  considered  the  necessity 
for  stretching  the  H2  bond  implies^  that  only  highly 
vibrationally  excited  H2  will  react  with  Li.  - 

Automation  of  a  prori  Techniques 


Now  that  a  priori  techniques  exist,  which  when 
carefully  applied  can  yield  reliable  and  independent 
information  about  simple  and  fundamental  chemical 
processes  (ie.  bonding,  excitation  and  ionization, 
collision,  chemical  reaction,  vibration),  it  is  impor- 
tant to  consider  how  such  mathematical  modeling*  can  be 
made  readily  accessible  to  the  non  theoretical  chemi- 
cal researcher  and  perhaps  more  importantly  to  the 
chemistry  student.     It  is  not  unrealistic  to  imagine" 
that  much  conceptional  chemistry  on  the  atonic  and 
molecular  level  could  be  greatly  enriched  if  these 
techniques  were  available  tc  the  s'tudent. 

In  our  laboratory  we  have  been  developing  our 
computing  systems  with  this  communic^tabili^y  in  mind .39 
Our  BISON  system  is  bas^  on  the  follcjd^ig^riteria: 

(1)  Responsiveness  to  questions 'i^SI:^ ant  to  the 
non-specialized  user  posed  in  his  language. 

(2)  Reliability  or,  at' least,  a  limit  of  error^ 
given  in  the  computed  result. 


(3)    Fool-proof  bl^k-box  operation^  in  which  the 
computing  system  makes  as  many  correct  procedural 
decisions  as  possible  in  answering  a  particular  chemi- 
cal question.     Implicit  in  the  concept  of  being 
responsive  to  a  truly  chemical  question  is  that  built 
into  this  new  computing  T.nstrument  must  be  a  great  deal 
of  the  specialist* s'experience.     {However^  only  that 
experience  which  has  been  demonstrated  to  yield  reli- 
able results  should  be  built  into  the  instrument  and 
therefore  unproven  methods  should  not  be  included  in 
such  an  instrument.) 

^  The  instrument  that  we  have  designed  is  being 
built  around  BISON,  a  research  system  which  consists 
of  a  selT-con'sistent  field  program  for  laCrge  diatomic 
molecules,  a  properties  program  for'  thq  same  systems., 
a  graphical  display  program  ^which  automatically  plots 
charge  densities,  a  transition  probability 
program,  a  spectroscopic  constants  program',^  and  a 
crystal  field  integrals  program,  all  unified  into  a 
well-coordinated  operating  system  which  permat^J  the 
user  to  do  any  or  all  of  the  above  tasks. 

•  This  desigii  can  be  broken  do^m  into  the  following 
components.     (1)  The  INTERVIEW  MODULE  ^of  BISON  con\^ses 
with  the  requestor  to  ^define  his  request  more  cle^^rly, 
to  inform  him  of  the  computerv time  involved,  and  to  pro- 
vide him  with  an  estimate  of  the  precision  of  the  an- 
swer;    (2)  the  {PROCEDURAL-EXECUTIVE  MODULE  executes  a 
sequence  of  calculations  to  answer  this  .chemical  ques- 
tion;    (3)  the  COMPUTATIONA-L  MODULES  are  used  by 
Procedure  Executive  to  answer  the  chemical  question. 
The  system  is  currently  capable  of  producing  Hartrpe- 
Pock  wavefunctions  and  properties  for  heteropolar  or 
homopolar  closed  and  open  shell  diatomic  molecules  con- 
taining atoms  from  rows  1  through  5  of  the  periodic 
table.     A  wide  variety  of  excited  states  are  permitted'.. 
It  will  also  produce  wavefunctions  based,  on  the  Adams 
localized  orbital  procedure,  as  well  as  solve  the.  SCF 
equation  in  the  presence  of  an  external  point  charge 
environment  (all  necessary  integrals  are  compute^); 
(4)  MOLECULAR  DISPLAY  MODULE.     BISON  will  generate  and 
plot  contour  diagrams  of  the  orbital  and  total  densi- 
ties of  the  molecule.     An.  interesting  feature  of  this 
process  is  that  the  contours  are  hunted  out  and  plotted 
automatically  and  appear  on  35-mm  film  as  computer  out- 
put or  on  an  off-line  CalComp  plotter.     This  display 
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feature  ia. particularly  intriguing  when  used  to  wa^ch 
a  molecule^^  form  or  to  make  comparison  between  a  mi9le- 
cule  in  several  states-^,  'or  between  molecules  of  a  ' 
homologous  series^l.     This  display • feature  can  be  usfed 
with  Hartree-Fock  or  multiconfiguration  wavefunc tion^v 

It  is  capable  of  producing  by  a  priori  computa- 
tions^ electronic  energies,  potential  curves,  charge 
densities ,  spec t rose opic_  cons  tan ts^  dissociation 
energies,  many  properties  dependent  in  the  charge 
density  such  as  dipole  and  higher  moments,  ionization 
potentials  and  bond  lengths. 

The  Procedure  Executive  and  Computational  modules 
have  been  adequately  described  previously  and  we  will 
not  discuss  them  here,  however,  we  have  been  changing 
and  redesigning  the  Interview  Module  and  further     '  * 
developing  the  molecular  display  capability.     In  what 
follows^I  will  present  some  of  the  new  aspects  of 
interviewing  the  BISON  under  which  are  of  general 
Interest  in  computer  interviewing  and  in  the  last 
section  will  disucss  some  features  of  and  recent 
material  produced  by  Molecular  Display. 

The  Interview  Module 

The  latest  version      of  INTERVIEW  has  been  de- 
signed^ to  use  Argonne^s-  time  sharing  system  RESCUERS. 
This  version  is  in  a  higher  level  language  with  good  . 
facilities  for  string  manipulation^^.     The  user  inter- 
face of  this  version  was  designed  to  provide  consider- 
ably more  flexiltility  than  the  older  version39. 

After  some  experience  with  our  original  interview 
program  it -became  clear  that  a  wider  spectrum  of  users 
must  be  accomodated,  the  'interviewing^  system  must  be 
flexible  enough  to  be  convenient  to  a*  user  acquainted 
with  the  nature  of  the  data,  while  having  the  potential 
to  explain  what  data  Is  necessary,  what  form  it  can 
take,  and  what  can  be  done  with-it,  to  a  user  who  is  in 
need  of  instruction  in  these  areas.     The  system  must 
also  be  self-explanatory,  so  little  or  no  preliminary 
instruction  i4  necessary  for  the  beginner.     However,  it 
is  undesirable  for  the  frequent  user  to  be  forced  to 
tolerate  informational' messages  and.  explanations  re- 
garding aspects  of  the  system  already  familiar  to  him. 
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In  its  most  active  form,  then,  the  processor  should 
function  as  a ^ teaching  aid,  wnile  at  its  most  passive, 
it  should  be  a  research  tool/  . 

To  accomodate  users** in  both  classes,  four  distinct 
levels  of  interaction  are  designed  into  the  present 
interview  system,  ranging  from  full  prompting  and 
^explanations  to  only  a  single  message.    We  can  group 
these  levels  for  purposes  of  explanation  into  two 
modes:    novfce  and  expert.    Within  each  of  these  modes, 
the  messages  can  be  abbreviated  giving  rise  to  four 
modes.     This  basic  distinction  grew  out  of  an  analysis 
of  conversation.     In  examining  conversations,  one  finds 
that,  at  a  giv.en  instant,  one  of  the  participants  has 
control  of  the  direction  of  the  conversation  (not 
necessarily  the.  present  speaker).    This  concept  of 
, conversational  control  has  been  adopted  in  the  use  of 
these  modes.     The 'expert  mode  allows  the  user  to  enter 
data  and  make  commands  in  any  order  and  at  any  time 
he  wishes*.     The  novice  mode  allows  the  computer  to 
choose  the  area  of  discussion,  leading  through  various 
topics  in  an  orderly  manner,  developing  a  data  set  with 
the  user,  while  explaining  what  can  be  done  with  the 
data  and  how  it  is  done.'   Through  use  of 'an  abbrevia- 
tion mechanism,  the  depth  of  explanation  can  be  varied. 
Along  with  this  concept  of  conversational  mode,  how-  \ 
ever,  comes  a  unique  problem.     How- can  a  Os'er  change 
from  one  mode  to  another,  with  a  minimum  of  effort, 
once  he  has  entered  a  particular  m^)de?    This  problem 
of  ultimate  control  of  the  conversation,  which,  of 
course,  must  remain  with  the  user,  is  splved  by  imple- 
menting a  command  for  transfer" from  the  expert  to 
the  novice  mode  (namely  »NOVICE»),  but  allowing  a  trans- 
fe;r  in  the  opposite  direction  implicitly,  by  ignoring 
the  question  ?isked  by  the  processor  and  entering  a^ 
command  legal  in  ^the  expert  mode.     Initially,  the 
user  is  asked  to  -rate  his  expertise  on  a  scale  cfrom  0 
to  3,  and  is  automatically  entered  into  the  appropriate 
jnode.     Design  of  the  user  interface  attempted  to  follow 
the  user  engineering  principles  discussed  by 
Hanson^5. 

There  are  no  specific  commands  in  tl^e  novice  mode; 
only 'responses  to  the  questions  asked  are  processed, 
and  errors  in  those  responses  (^indicating  a  lack  of 
understanding)  trigger  further  explanation  of  the 
request.     The  expert  mode;  on  the  other  hand,  contains^ 
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commands  which  can  be  grouped  into  two  categories,  data 
commands  and  computation  commands.    A  data  command  is 
a  data  name  (there  exists  a  flexible,  but  formally  de- 
fined list  o^^egal  *data  names),  followed  by  a  space, 
followed  by  a  value,  optionally  followed  by  a  speci- 
fication of  units  and/or  a  spec'J.fication  of  states. 
All  irrelevant  information  is  ignored,  such  as  'speci- 
fication of  units  in  certain  cases.  Computation 
commands  ai^e  requests  for  the  calculation  *of  some  chem- 
ical property.    Suoh  commands  are  simply  passed  on  to 
BISON  in  the  appropriate  form.     In  additioji  to  these 
♦commands  are  systems  commands,  which  can  be  invoked  at 
any  point  in^the  program,  and  which  are  either  commands 
known  to  the  rescue  system  or  direction^  to  the  pro- 
cessor, a  few  of  Vhich  are  described  below. 

If  the  user  enters  the  command  'HELP',  operation 
of  the  processor  is  temporarily  suspended,  and  the 
'-help'  processor  is  entered.    The  'help'  processor  will 
.  allow  the  user  to  access  information  regarding  such 
'  things  as  processor  structure,  legal  commands,  their 
ih^aning  and  usage,  and  other  generally  comforting  ex- 
planations.    This  is  a  standard  part  of  all  processors 
in  the  Rescue  system. 

'CHECK'  is  a  command  useful  in  the  expert  mode. 
It  examines  "whatever  data  is  presently  being  created, 
and  informs  the^user  of  any  logical  inconsistencies  or 
insufficiencies.    Th*e  user  can  then  anticipate  certain 
,  errors  before  costly  <5omputing  time  is  wasted.  This 
function  is  built  into  the  novice  mode. 

'FINISH'  performs  first  the  'check'  operation,  and 
if  no  problems  are  detected,  writes  a  Rescue  file  with 
the  x:reated  data,  which  tan  then  be  submitted  to  BISON. 

'LIST'  lists  the  data  So  that  it  can  be  scanned 
visually  for  errors  wAich  cannot  be  detected ^by  the 

trogram;  such  as  an  incorrect  value  or  a  typographical 
rror. 

In  addition,  there  are  several  commands  which  cal- 
culate such  things  as  the  cost  of  calculation,  the 
accuracy  of  resu]^ts,  and  the  amount  of  computing  time 
necessary, 

* 

All  commands  can  be  prefaced  by  the  keyword  'NO', 
which  will  per^form  the  opposite  of  the  operation* 
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normally  indicated  by  the  following  command.     For  ex- 
ample, the  input  line  »N0  IONIZATION  POTENTIAL^  would 
counter  any  previous  command  to  calculate  the  ioniza-v 
tion  potent.ial.    When  the  command  *N0*  is  irrelevant, 
it  is  ignar*ed.    Since  commands  are  recognized  by  scanning 
a  table  of  keywords,  they  may       specified'' by  the,  mini- 
mum number  of  letters  (consecutive  from  the  beginning 
of  the  word)  necessary  to  distinguish  them  from  other 
keywords.'   In  most  cases,  this  number  is  not  greater 
than  three,  and  often  one  letter  is  sufficient.  Thus, 
in. the  above  example,   » NO  10 »  would  have  been  suffic- 
i^t.     Some  commands  require  the  user  to  s-pecify  the 
relevant  state  of  the  molecule.     During  execution,  the 
syatem  remepibers  a  set  of  one  or  more  current  active 
stages.     These  apply  by  default  if  no  state  is  speci- 
fiedVfor  a  command  which. requires  one.     The  default 
is  changed  when  a  user  specifies  a  set  of  states  in  a 
command,  or  he  enters  a  » STATE »  ^command.     In  this  way, 
data  and  computation  commands  which  are  common  to  more 
than  one  state  need  not  be  entered  a  large  number  of 
times. 

Using  these  commands  and  the  conversational  struc- 
ture, the  processor  is  capable  of  both  creating  a  con- 
sistent data  set  suitable  for  BISON,  and  serving  to 
explain  the  function  and  form  of  the  data  to  a  user 
'who  is  unfamiliar  with  the  potentialities  of  the  BISON 
system. 

c  46 

Molecular  Display-^^^ 

4 

One  of  the  most  intriguing,  novel  and 'experiment- 
ally inaccessible  features  of  the  a  priori  mathemati- 
cal modeling  of  'chemical  processes  is  its  ability  to 
yield  accurate  .electronic  charge  densities  and  their 
change  at  arbitrary  magnification  for  an  atom,  (Chart 
1),  molecule  (Charts  3  and  M)  or  during  a  process 
such  as  molecular  formation  (Charts  2  and  Films),  ioni- 
zation, vibration,  or  chemical  ruction  (Chart  5). 
Currently  the  BISON  system'  plots/feques ted  contour 
values  of  electronic  charge  det^sity  on  a  calcomp 
plotter  on  5  millimeter  film  from  the  computed  waVe- 
function.    Also,  one  may  plot  interactively  on  the  2250 
utilizing  a  precomputed  library  of  atomic  and  molecular 
wavefunctions.     The  interactive  systfem  can  be  used  to 
carefull^^and  quickly  design  and-layiout  sequences  and 
charts  for  educational  use.     Such  cornputer  produced 
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plots  have  b^en  used  in  a  series  of  educational  films 
and  wall  charts^l*    Further  such  filmS 'and  charts 
covering  different  aspects, of  atomic  and  moleculap^ 
structure  and  chemical  processes  are  being  developed  as 
the  accurate  wavefunctions  describing  the  system. are 
computed.    We  are  currently  experimenting  with  alter- 
native ways  of  presenting  wavefunctions  and  electronic  • 
charge  densities.    Among  these  are  cl'ouds  of  perspec- 
tive dots,  Shape  diagrams  (a  surface  of  constant  value), 
a  ra-ster^color  TV  system  driven  by  a  stand  alone 
computer^' ^ 

It  remains,  however,  a  challenge  to  strike  a- bal- 
ance between  asthetlc  appeal  atnd  quantitative  infor- 
.motion  content.        ^    ,  ^ 

Workshop  in  Chemistry  .from  Computers 

,  /  We  have  used  the  BISON  System  and  the  above' con- 
cepts in  a^  "Cjiemistry  from  Computers"  workshop  held  at 
^Argonne,  1970  and  1971.     This  wqrkshop  involved  a 
series  of  introductory- lectures  and  then  use  of'BISCN 
by' the  participants  on  an  actual  .problemV-  The  partic- 
ipants consisted  of  college  faculty  and  were  not 
theoretical  chemists/  .  Two  projects  were  completed. 

1)  To  e:valuate  the  dissociation,  energy  and 
various  properties,  for  the  ground  state  of  the  LiH  < 
p'rolDi^m  utilizing  the  OVC  method. 

2)  To  present  qualitative  and'  quantitative  evi- 
dence as  to  why  the  chemical  bond  in  ^2"^  is  stronger 
than  the  bond  in  Li2  although  a  bonding  electron  has 
been  removed.  Tlnis  was  accomplished  through  the  visual 
analysis  of  charge  densities  sind  difference  densities 
and  also  on  the  basis  of  various  contributions  to  the 
total  energy  of  the  systems. 

The  participant  response  to  this  workshop  was 
enthusiastic  ax\d  it  was  encouraging  that  in  1  weelc 
with  properly  designed  tools  non-specialists  could 
successfully  attacK  on  an  operational  level  complex 
quantum  mechanical  calculations. 

Summary 

The  relatively  new  ability  of  a  priori techniques 
for  approximately  solving  the  Schro'Hinger  equation  to 
an  accuracy  which  is  capable  of  providing  reliable 
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chemical  information  independent  from  experiment 
stimulates  the  development  of  effective  means  of 
interfacing  these  techniques  with  the  non-specialist 
and  the  student.    When  such  interfaces  are  coupled' 
with  the  ability  to  display  the  details  of  simple 
chemical' processes  on  the  atonic  and  molecular  level, 
a  useVul  teaching  aid  emerges  which  could  replace  or. 
complement  laboratory  experiments  for  both  student 
and  working •chemist. 
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MO  Theory  as  a  Chemical  Tool 

Michael  J.S.  Dewar 

Department  of  Chemistry,  The  University  of  , Texas  at 
Austin,  Austin,  Texas  78712 

Introductioh 

If  quantuin  'theory  is  to  be  used  as  a  practical  aid 
to  organic  chemistry  by  providing  quantitative  estimates 
of  molecular  structure  and  chemical  reactivity,  the 
resuilts  must  not  only  be  of  sufficient  accuracy  but  must 
also  be  achieved  without  iilldue  cost  for  molecules  of 
reaaoncUDle  size. 

Since  the  degree  of.  accuracy  required  is  far  out- 
side the  reach  of  purrent  ab  initio  methods,  the  only 
hope  lies  in  some  semifeiiipirical  approach.     Either  some 
approximate  and  inherently  inaccurate  ab  initio  treat- 
ment may  be  shown  by  experiment  to'  give  results  of 
sufficient  accuracy,  due  to  a  fortuitous' cancellation 
of  iarrors,  or  sufficient  accuracy  may  be  achieved  by 
introducing  parameters 4 into  the.  treatment.     In  either 
case,  the  treatment  must  be  thoroughly  tested  by  com- 
parison with  experiment  before  any  confidence*  can  be  i 

placed  in  its  use  for  predicting  chemical  behaviour. 

2  . 

A  minimum  condition  that  must  be  -met    is  tjiat 
the  properties  of  stable  molecules  should  be  re- 
produced with  "chemical"  accuracy,  i.a.  .  ^ 

(a)  heats  of  atomization  to  the  order  of  ±1 
kcal/mole; 

o 

(b)  bond  lengths  to  the  order  of  ±0.01  A; 

(c)  bond  aingles  to  the  order  of  ±1°; 

(d)  force  constants  to  the  order  of  1%. 

Jf  the  results  are  to  ne  meaningful  it  is  also  neces- 
sary to  calculate  molecular  geometries  completely;  use 
of  assumed  pr  experimental  geometries  can  lead  to  large 
(in  a  chemical  sense)  errors  in  the  energy  for  mole-  ^ 
cules  of  quite  moderate  size.     The  equilibrium  geo- 
metry can  be  inferred  only  by  interpolation  from  cal-^ 


culations  for  numerous  iadividual  configurations,  at 
least  several  hundred  even  for  a  simple  molecule  such 
as  methylcyclohexane  and  at  least  several  thousafnd  to 
define  the  reaction  path  and  transition  state  for  a 
reaction.     Unless  *each  calculation  can  be  carried  out  * 
in  seconds  rather  than  minutes,  hours  or  days,  the 
cost  of  the  calculation  will  be  prohibitive  in  relation 
to  its  chemical  value.  ' 

These  considerations  exclude  the  "ab  initio  SCF" 
approach.    Not.dnly  are  the  results  too  inaccurate  but 
the  cost  of,  qairrying^  out,,  the  calculations  properly, 
i.e.'  with  optimization  of  geometries,  is  wholly  exces- 
sive and  will  remain  so  in  the  foreseeable  future. 
Artificies  which  ^reduce  the  computation  time,  e.g.  the 
use  of  assumed  geometries  or  very  limited  basis  sets, 
make  the  errors,  which  are  already  too  great,  even 
greater.  • 

The  only  hope  for  success  at  the  present  time 
'therefore  lies  in  the  use  of  a  semiempirical  treatment, 
based  on  a  still  cruder  approximation  to  speed  up  the 
calculations  and  fortified  by  the  introduction^of 
parameters  to  increase  its  accuracy.    Thjs^  success  al- 
ready achieved  with  a  very  preliminary  treatment  of 

2  3  • 
•this  kind  fMINDO/2  '   )  seems  to  leave  no  dpubt  con-  ^ 

cerning  th-e  feasibility  of  such  an*^approach.' 

All  treatments  so  far  suggested  c|,re  based  on  . 
simplified  versions  of  the  R6oth^ah  method  in  which 
integrals  involving  diatomrci^differentxal- pverlap 
are  neglectec^,  and  also  a  varying  number,  of  integrals, 
involving  irionocentric,  overlap  ((Sndo^  ,  INDO      PNDO  ,  * 
NDDO  )  .   ,We  have  'so  far  mainly  concentrated  on  the 
INDO^  approximation,  parametrized  to ^ give  correct 
heats  of  atomization.  and  molecili4r  geometries  rather 
than  to  mimic    the  .  results' of  "-^'b  inifeio  SCF"  cal- 
culations^    To  distinguish  this  approach. from-  the 
origina?!  INDO; -method,  we  have  termed  it  MINDO  (Modi- 
fied IN0(3)v;   The  parametrization  proved  unexpectedly 
diff^puii:  since,  the  parameters  are  so  ifiterrelated 
that  lt:'*is^impossible  to  predict  intuitively  the 
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result  of  a  given  variation  in  any  one  of  them; 

initiially  we*  were  able    only  to' reproduce  energies  of 

\  *  '  '  6  7 

molecules  using  assumed  geometries   (PNDO  ,  MINDO/1  ) . 

2 

Eventually  we  devised    a  computer  program  to  optimize  ^ 
parameters  automatically;  use  of  this  led  to  a  treat- 
ment (MINDO/2)  which  gave  gopd  estimates  of 'geometries 
heats  of  atomization,  force  constants,  ionization 
potentials,  dipole  moments,  and  nuclear  ^adrupole 
coupling  constants  for  a  wide  variety ''bf'  molecules  con- 
taining the  combinations  of . elements  CHON,  CHF,  or  CHCl. 
Thi^  initial  treatment  suffered  from  a  number  of  de- 
f;Lciencies  most  of  which  have  been  removed  in  a  new 
version  which  is  almost  comple^te;  we  have  also  success- 
fully, parametrized  the  other  approximations  noted  above 
(GNDO,  PI^DO/nddP).     We  believe  that  a  properly  para- 
metrized version  of  NDDO  will  provide  the  optimum^ 
solution  in  terms  of  accur'acy  and  computation  time;  at 
present  the  computation  time  is  too  long  by  a  factor  of 
ten,jsmall  'enough  to  be  removed  by  forseeable  improve- 
ments in  comp;d[€ers.'^    The  v^ork  reported  below,  refers  to 
qalculatiojfe  by  MINDO/2 •     It  should  be  emphasized  that- 
its. main  value  lies  in  the  promise  ot  more  to  come 
rather  than  present  achievements  since  we  already  have*, 
better  parame1;ers  available  •     Nevertheless  the  results' 
Obtained  are  already  of  some -cheijtiiey!*  interest . 

The  calculation  of  molecular  geometries  has  been 
carried  out  using  a  program  written  by  Dr.  A.  Broittn, 
based  on  the  SIMPLEX  mft^h-od^.     In  orcjer  to  optimize  the 

geometry  of  a  system  defined *by  n  independent " coordi- 

2  ' 
nates,  approximately  n    calculations  are  required •  This 

^sult  can  also  certainly  be  surpassed •     Reaction  paths 

^re  usually  calculated  by  taking  as  reaction  coordinate 

some  dimension  -of  the  reacting  system  (usually  an  " 

interatomic  di^^nce  or  angle)  which  changes  monotoni- 

cally  during  the  reaction  and ' minimizing  the- energy  of 

the  system  with  respect  to  the  other  coordinated  for  a  f 

series  of  values  of  the  reaction  coordinate.     As  we 

shall  see  presently,,  'this  procedure',  which  has  the 

blessing  of  most: -standard  texts,  on  reaction  kinetic7,  x 
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sometimes  fails  in  an  interesting  and  mechanistic^ly 
significant  manner.  When  it  succeeds,  the  maximum  in 
the  plot  of  energy  vs  reaction  coordinate  corresponds 
of  course  to  the  transition  state. 

Results 

Some  typical  results  given  by  MLNDO/2  for  various 
chemical  problems  are  summarized  below.     For  a  more 
detailed  review,  see  ref.  (1). 

A.     Geometries  and  energies^     Comparisons  of 

o 

calculated  (observed)  bond  lengths   (A)   and  heats  of 
atomization  for  some,  simple  molecules  are  shown  in 
Figure  1.     The 'heats  of  atomization  have  for  con- 
venience 'been  converted  to  heats  of  atomization  (AH^; 
kcal/mOle  at  25°C)   using  experimental  values  'for  heats 
of  formation  of  atoms.  'The  results  are  typical  fqr 
the  many  moledules  yte  have  studied;   large  deviations 
occur  only  in  small  ring  compounds   (^g  cyclopropane 
and  cyclobutane  where  the  strain  energies  are  under- 
estimated sometimes  by  as  much  as  25  kcal/Jnole)  and 
compounds  containing  two-^dj^cerit  heteroatoras  (eg 
peroxides  and  hydra-2i?tes)  where  the  corresponding 
bond  length  is  Jroo  short  and  AH^  much  too  negative. 
We  feel  certain  that  the  former  problerrt  can  be  over- 
come ;  the  latter ,  however,  is  probably  an  inevitable, 
consequence  of  the  XNDO  ,approxi1tt^|^on ,  due  to  it^ 
neglect'  of  certain  integrals  involving  one-center  over 
lap.- 

"       (1'534)  ,  (1.338)        ^       ,    •    ■  . 

AH^  -24. 7  (-20'.  2)*  .   13.-4(12.5)         .   '  53.4  (54.3) 
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^2C«S«     1.455(1.467)         H^CSJCH-^KCH  (CH^)2l*-CH3 

1^9  ^ru^ru  1*323     1.404     1,191  . 

(R43)  ^"«^"2  (1.34)    (1.44A)C1.20)  •  / 


30.9(26.3)                            67.3(69)  3.3{-5.2) 

1 

1.332  1.334  1,224              L°~  ^'C       /  Lo— •  N— oj 

(1.437)    (1.-334)    (1.200)     1.239         •  0  /  1.173'(1.189) 

(1.241)  • 

-84.5(-81.0)                      '       -95.8(-89±5).    •  "  -10  .-9  (-8 . 1) 


—  C-..^  0  1.162(1.189)  1.278    •  1.187 

1   TftQ  (1.2»4)       (1.168)  • 

1.091  1.308 

(1.083)  (1.304-P'-^^^' 

-20.4(-14,6)  -94. 0(^91.4)       -   -41,9(-23.4;  -47.4) 


Figure  !•     Comparison  of  calculated (observed) "  heats  of  formation 
(^H^)  (kcal/mole)  and  bond  lengths  (A)  •  The 
"calculated"  values  of  AH^  were  estimated  from  MINDO/2 
heats  of.  atomizatioHr  using  experimental  values  for 
heats  of  formation. of  atoms. 

B.   ^'Force  constants.    Table  I  'shows  a  comparison  of 
calculated  and  observed  force  coi>stants  f or  stijetching  of 
individual  bonds  in  molecules.    The'  errors  are  again  less  by 
at  least  an  order  of  magnitude  than  those  given  by  other 
methods .       » '  * 


Table  I..  ;Calculated  (Observed)  Stretching  Force  Cons»t2[nts 


Molecule 

Bond 

V 

Force 
calcd. 

constant 
i   ■  obsd'. 

CH^ 

•  CH 
CC 

5.8 
4.9 

5.0 
4.5' 

CH 

5.7 

4.8 

1 

■C2«4  . 

CC 

CH  w. 

'aJ^'I  5,7' 

9.6  \^ 
•  5.1 

f 
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Table  I.  (continued) 

• 

Molecule 

Bond 

Force 

dX  • 

constant 

obs  d  • 

C2H2 

X  ^  •  JL 

1  Q  Q 
1  D  •  0 

(\  1 
0  «  JL 

^  ft 

HO                       .  ' 

HO  ' 

XU  •  X 

7  ft 

CO 

X  0  • 

NH, 
•J 

.  NH  , 
/ 

NO 

7.7  , 

6.4 

14.4 

■  11.4 

'C>     Car-boniunr  ions>     A  comparison^  of  Calculatec^ 
(observed)  heats  of  atomizatign  (as  AH^,  kcal/mole)  for 
some  carbonium  ions  is  shown  below.     The  experimental  values 
.afe  from  photoionization  studies^^.     Apart  from  CH^^,  the 
agreement  is  good. 

CHjj  CH2CH2+.  CH3CMCH2       CH2CHCH2CH2  (CH^Oj^C 

2  76  (  2  6  0)         ■     225(219)  192  (190)         183(184)  171(173) 


..•The  calculated  energies  ofi  various  isomers  of  protonated 
cyclopropane  are  as  follows: 


4 

CH.-CH-   .  CH^-SSiCH^      •         »"2     +    .^"2     ■  CH'S-^'CH- 

2.       2.          •        .  i  i  'H' 

216  195       .  188  248 

The  njpropyl  cation  is ^predicted  to  be  unstable,  isomeri^ing 
directly  "to  the  edge-protonated*  isomer.     These  conclusions  diff^ 
from  those  of  Pople  et  al^^,  based  on  ab  initio  SCF  calculcitions; 
however  they  s        to  be  supported  by  the  available  evidence.  The 
solvation  energies  of  n-Pr    ^nd  i-Pr    must  be  glreater  than  those 
of  the  "nonclassical'''  ions;  therefore  unless  the  latter  lie  well 
belpw  n-Pr    in  energy,  they  would  not  be  formed  in  solution  from 
n-Pr*^.  ,  C"^^* 

"**  *     12  • 

The  7-norbornadienyl .  cation  and  anion  are  .'pre^dicted      t(3  - 
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have  the  following  unsymmetrical  structures;  these  would  be 

expected  on  the  basis  of  attractive  and  repulsive  interactions 

betweert  the  c"^  and        centers  and  the  double  bonds.     The  ' 

barrier  to  "bridge  flipping"  in  the  cation  is  predicted  to  be 

13 

24     kcal/mole;  NMR  studies      confirm  the  distorted  structure 
and  fix  a  lower  limit  of  ^22  kcal/mole- to  the  "f lipping*'  barrier. 


Calculations  have  also  been  carried  out  f6r  a  number  of 
other  carbonim  ions  with  generally  satisfactory  results  and 
for  related  species   (e.g.  carbenes) . 


D,     Energetics  of  some 'simple  reactions.     Table  II  gompares 
calculated  '  .  .  and  observed  activation  energies  fqr  twisting 
about  the  C=C  bonds  in  ethylene  and  the  cumulenes..'    The  agree- 
ment is  satisfactory,  whereas  the  "ab  initio  SCF"  method  gives 
values "^^  that  are  too  lar^e.    A  number  of  6ther  ^torsional  bqirrl^^rs 
have  also  been  calculated  with,  comparable  success. 


Table  II. 
Molecule 


Activation  energy,  for  Twisting  about  C-C  Bond^.' 

^Barrier  to  ro*tation  (kcal/mole)* 
calccy.  MINDO/2  obs,  calcd.  ab  ^§itip 

scf 


H2C 

H2C— 

H2C=C=C=CH2 

H2C=C-C-C-CT2 

H2C=C=C=C=C=CH2 


5  3. .5 
36.7 
31.6' 
24.8 
22.0 


65.0 


30  , 


138 
92 
74 


20 


The  relative  energies   (kcal/mole)   for  the  chair  and  boat 

'        '  , 

conformers  of  cyclohexane,  and  of  three  possible,  transition 
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states  for  their  interconversion ,  are  as. follows: 


16 


(0) 


fl3.D 


The  calculated  heat  of  isomerization  agrees  very  well  with 

experiment   (5.3  kcal/mole) .     Isomerization  is  predicted  to 

occur  via  the  first  transition  state,  in  agreement  with 

17 

calculations  by  Hendrickson     ,  using  the  Westheimer-Allinger 

method.     0ur  value  for  the  activation  energy  is  less  than 

18 

that  observed  (10.3  kcal/mole     )  by  about  the  same  amount 

that  Hendrickson's  value  (12.7  kcal/mole)  was  too  high. 

Several  other  reactions  have  been  studied  with  comparable 

success,  including  hydrogen  abstraction  from  methane  by 
2 

methyl,  radical  ,  inversion  and  bond  exchange  in  cyclopcta- 

19  *  20 

tetraene,       carbon  atom  insertion  into  double  bonds, 

isomerization  of  melfhyi  isocyanide  to  acetonitrile , i'nr 

sertion  and'  addition  reactions  ^of  singlet  and  triplet  carbene 

:art 
13 


and  difluorocarbene,  ^ ' ''■^  and  the  structure  of  the  Favorskii 


intermediate . 


E.     The  Cope  rearrangement.     The  Cope* rearrangement  of 
1, 5-hexadiene  (A)   into  itsjelf  oan  tcike  place  via  a  chair- 


21 


(B)  or  boat-(C)   type  transition  state 
relative  energies   (kcal/mole)   are  shown  below. 


The  calculated 


22 


B  . 


or 


i — ^ 
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I 

22 

B  IS  ccrrrectly     .predicted  to  be  preferred  and  the 
22 

calculated      difference  in  energy  between  B  and  C  (6,6 

21    '  * 

kcal/mole)  agrees  well  with  that  observed      (5.7  kcal/mole) • 

Fiirther  studies  showed  that  this  difference  is  due  to  am  anti 

bonding  interaction  between  atoms  a  and  b  in  c,  as  would  be  . 

23  — 

expected  from  Evans'  principle. 

Calculations  were  also  carried  out  for  chair  transition 
states  D  and  E  with  a  methyl  substituent  in  a  pseudocixial  (D) 
or  pseudoequatorial  (E).  position.     E  was  predicted  to  be 

O  A 

favoured,  in  agreement  with  experiment     ,  and  the  calculated 
difference  in  energy  (2.0  kcal/mole)   agreed  well  with  that 

O  A 

(1.5  kcal/mole)  observed. 


D 


-7 

i  I 


•  These  calculations  thus  reproduced"  the  irteiin  results  ^f 
two  extensive  experimental  studies  at  ,a  small  fraction  of  the 
cofet. 

25 

The  calculations  have  been,  extended      to  reartangements 
in  the  bullvalene  series:     as  th^  following  results  show  the 
calculated  activation  ener'gies.  rkcal/mole)  agree  quite  veil 
with  experiment  (in  parentheses) . 


<9 

^  H 


The  barrier  in  semibullvalene^  (H)  itself  is  too  small  to 


*The  estimated  cost  ot  the  calculations,  jtncluding  sa^laries, 
overhead,  and  qost  of  computer  time  at  the  full  commercial 
rate  ($750  per  hour),  was  $5000. 
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2  6 

measure.     That -in  octamethylbullvalene  is  6.4  kcal/mole 

A  detailed  study  of  our  MINDO/2  results  by  .energy  parti- 
27 

tioning      indicates,  however,  that  the  methyl  substituents 
should  increase  the  barrier  to  rearrangement  in  H. 

-  28 
F.     Electrocyclic  processes/    Current  studies      of  the 

electrocyclic  ring  openings  of  cyclopropyl  ions  and  radidal 

to  allyl,  e.g. 


A 


and  of  cyclobutene  to  butadiene, 


□ 


^ — ^  ., 

K  L 
have  led  to  some  interesting  conclusioijs  and  predictions. 

Table  III  shows  calculated  activation  energies'  for  con^ ♦ 
rotatory  and  disrotatory  opening  of  the  cyclopropyl  systems, 

the  apical  angle  9  ia  I  being  taken  as  the  reaction  .coordinate-, 

-  29  -    *"  ' 

The  catioh  is  correctly      predicted  to  ^'pen  by  a.  disj;"Otato^y 

path  and  .ifhe  anion  by  a  conrotatory  one,  the  differences  in 
activa^tion  energy  being  large,.     Two  unexpected  features  are 
the  prediction  that ''opening  of  the  radical  should  be  dis- 
rotatory ,and  almost  as  stereospecif iq  as  that*  of  the  cation, 
and  that  the  reaction  of" the  cation  should  require  activation. 

The  former  prediction  disagrees  with. that  of  Woodward  and 

^         30    '  '  . 

Hoffmann      while  the  latter  seems  to  be  at  variance  with' 

31 

experimental  evidence      that  con'certed  rearrangement *of  I  to  J 
assists  solvolysis  of  cyclopropyl  esters.     Th^  latter  dis- 
crepancy w^s  resolved  by  a  demonstration  tl\at  ^pyxam^idaf  cyc5io- 
propyl  cation,  corresponding  to  the  nascent  ion  in  the  solvolyti 

reaction,  rearranges  without  activation  by  tHa  pathj  shown  to  be 

•  .     '     '      '         .  32  • 

followed  by  Schleyer  et  al..     The  available  evidence ^ ^   seems  to 

support  our  prediction  that  I  qhould  be  a'  stable  species  if  it 

is  set  free  in  a  planar  form.  ^  \  *  . 
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Table  III.  Calculated  Activation  Energies  for  Electrocyclic 
 Reactions  of  Cyclopropyl  


Compound 


Calculated  activation  energy  (kcal/mole) 
disrotatory  conrotatory 


0- 


7.5 
65.7 
24.7 


38.0 
30.7 

52;  3 


The  electrocyclic  opening  of  cyclobutene  (K-^L)  is  predicted 
to  take  place  conrotatorily >  in,  agreement  with  experiment.  In"' 
this  case  the  calculations  for  the  "forbidden"  disrotatory* 
reaction  led  to  a  remarkable  result.  When  the  bond  length  r  in 
K  was  taken  as  reaction  coordinate,  a  plot  of  energy  Vs  if  h^d 
the  unexpected  form  shown  in  Figure  2.     Neither  the  .forward  nor' 
the  backward  reactions  took  place  I    Evidently  there  are  two 
potential  minima  for  each  value  of  r ;  one  corresponds  to  a 
geometry  analogous  to       with  the  CH2  groups  perpendicular  to 
the  C^  unit,  while  in  the  other  all  the  atoms  are  dopla^ar,  as 
in  L;  .  * 


E 


Figure  2.     Plots  of  E  VjS  r  for  electrocyclic  opening  of  K  (— ) 
and  closure  of  L(  ) 

This  result  is  of  course  easily  explained  in  terms  of  Evans* 
f'.     .  24 

principle     .     Peticydlic  reactions  involve  a  cyclic  interchange 

*.  * 
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of  bonds,  analogous  to  the  interconversion  of  Kekule  structured 
in  benzene; 


If  the  transition  fetate  i§  aromatic,  a  hybrid  c^f  the  two 

structures  i^ 'more -stable  than  either  so  *the  reaction  involves 

.a  sing  Id  potential.vminlmvim.'    If  antiaromatic ,  there  are  two. 

valleys  -in  the  potential . surface ,  . separated  by  a  ridge,  since^ 

■>  * 

intermediate  hybrid  structures  have  a  higher  energy.  Similar 


behaviour  shoAild  be^hown,  by.  other  reactions  involving  anti- 
aromatic     .transition  states;,  this'  has  been  so  in  ail  cases 
we  have  studied  (e.g.*  the  ^' forbidden*^  reactions  in  Table  III 
and^  the  cis-cis  dimerisatioh  of  ethylene  to  cyclobutane) . 
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COMPUTER -ASSISTED  STUDY  OP  LIQUID  WATER 

^'  '  -      ■  by 

Prank  H.  Stillinger 
Bell  Telephone  .Laboratories,  Incorporated.    /  - 
Murray  New  Jersey  07974 

•  .    ^        *   On  a'ccount  of  its  pre-eminent  importance  to  many 
ifid  diverse  branches  of  science,  water  has  remained  the 
object  of  intense  theoretical  study  fpr  many  years.  In 
the  liquid  phase,  th4.s  substance  exhibits  rather  anomalous 
properties/  not  the  least  of  which  is  its  apparen|;;ly  unique 
abilit^y'*^o  act  as  a  fljiid  medium  for  life  processes..  '  In 
broad  terms  therefore  the  thrust  of  theory  consists  in 
3l)^w.ing  .how  the  structure  of  iy^ivfdual  water  molecule's 
irhplies  anomalous  behavior  for  their  aggregates. 

.   Viewed  from  the  modern  standpoint;  two  tasks  must 
be  accomplished  in  sequence.     First  it  is  nepessary  to 
determine  the  nature  of  interactions  between  .water  molecules 
I.e.  the  forces  and  torques  that  neighbors  exert  on  one 
another.     Subsequently,  these  interactions  must  be  employed 
to  infer  the  way  that  molecules  in  the  liquid  arrange  them- 
selves next  to  each  other,  and  the  macroscopic  properties 
(■density,  viscosity,  dielectric  constant,  solvent  behavior, 
etc.)  vihlch  result  froni  tlhro^,^. --arrangements .  *  The  specific 
disciplines  upon  which  one  must  di>aw''Jpor  satisfactory  '  ^ 
quantitative  .ans>/ers  In  '  these. ^respe^ctl vfe  -Area's  are  molecular 
-quantum  theory,  and  statist  leal- m'ecl^&nl'cs . 


In  .the  past,  theoreticians  have  not  possessed  * 
sufficlBntiy  powerful,  tools  to  achieve  comprehensive  und'er-. 
standing  in  either  a]^ea.    As  a  result,  thje  proposed  explana- 
tions.  Tpr  the  nature  of  liquid  water  have  been  highly  , 

speculative  and  diver^se.     II:  is  hardly  Surprising  that 

\    ^  .  . 

contradictory  pictured  h^ve  arisen  concerning -the  nature  of 

local  moieciula-r  order,  in  liquid  water.    By  hindsight,  one         '  ' 

is  inevitably  reminded  of  the  famous  fable  about  blind  men 

and* the  elephant.    .      ^    ^  .  / 

Very  recently *both  the  molecular  quantum  mechanics 
and  the  statistical  mechanics  required  for  a  full  deductive 
theory  of  liquid  water  have  unSlrgofie  significant  advcinces. 
These  advajnces  have  become  possible  through  the  wide    *  ^ 
availability*of  rapid  electronic  Qomputers .     In  the  flr^t 
instance,  all-electron  Hartree-Pock  computations  have  ome 
feasible  for  pairs  and  triplets  of ^ water  molecules,'  to  permit 
accurate  determination  of  interaction  potentials.  ^  Subsequently > 
molecular  dyniimics  simulatlpns  based'  upon  those  interactions 
have  been  carried  oujj  on  small  ^samples  of  lic^uid  water. 

Both  the  qtiantum-mechanical  calculations  and  the 
statistical-mechanical  simulations  have  imgressjive  powers  of  * 
prediction  in  areas  not  currently  accessible  to  experiment.  • 
One  can  determine  the  extent  to  which  the  potential  is  hon- 
additive,  and  the* degree  to  which  that  non-additivity  depends 
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on  hydrogen-bond  pattern;    In  the -liquid,  it  is  po-ssible"- to 
examine  the  detailed  architecture  of  hydrogen-bond  pattei^ns^, 
and  to  establish  the  rate  of  bond  disruption. 

The' molecular  dynamics  in  particular  has  .a  spec^i-al 
pedagogical  value.  'Molecular  configurations  generated  ' 
during  the  course  of  the  computer  dynamical  .run  have  been 
rendered  into. the  form  of  stereo  photographs  which , may  then 
be  viewed  dire.ctly.    Not.  only  doe^  this  visualizJation  capacity 
demonstrate  the  inappropriateness  of  earlier  water  models 
b^sed  upon  disordered  crystal  structures}  it  also  leads  one 
to  formulate  novel  quantitative  questions  to  pdse  to  the 
computer .  *  '  ^         "  ' 

Thus  emboldened  by  significant  advances  in  under- 
standing pure  water,  we  are  now  engaged  in  devising  strategies 
to  study  simple  solutions,  and  in  the  longer  run  to -sturdy 
complicated  aqueous  systems  of  biochemical  iaterest. 
Prospects    for  major  success  -in  th^e  foreseeable  futu^e^  will 
be  discussed .  .  . 
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1 ,  Jntroduction 


In  elucidating- various":properties  of  the  classical 

anharmbnic  solid  (a  v-dimensional  lattice  system  of  interacting 

•  * 

particles  \!(^ith  purely  repulsive  potentials)  tjje  central  problem 

is  the  solutiorNaxa^roximation  of  the  canonical  N-particle 

.     ■•  •   .  ■        ■  (' 

configur^tional  integral.    A  very  powerful  approach  fs-^the  use 
of  the  cell-clu!ster  scheme  which  is  a  systematic  procedure 

for  constructing  an  asymptotic  series  approximation  to  the 

V 

N-bqdy  partition  .function,  Q^^  [1].    For  small  N,  in  the  close- 
packed  limit  (high  density),  it  is  possible  to  .construct  regions 
of  configuration  space  (convex  polytop^)  the  content  or 
vN-dimensional -volume  of  whiph^Jnay•  then  be  ascertained.  The 
integration  procedure  is  able  to  be  formulated  in  such  a  way 
that  the  .cell-cluster  contributions  of  any  oi^der  may  be  deter- 
mined  exactly  for  limiting  (cum  linearized  bounds)' polytopes . 
Because  of  the  incredibly  large  number  of'  pair  interaction 
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cbnf igurations  for  N  >  5  such  an  approach  to'  the  many-body  prob 

23  -  ^  ' 

lem  (we  r^quire^*  =  ,0^10    ))  quickly  exceeds  the  capacity  of 

cor*puters|  currently  available.    Fortunately,  the  cell-pluster 

scheme  slows  promise  of  convergence  for  small  N.    In  fact, 


converge: ice  has  been  demonstrated  for  one-dimensional  particles 
While  this  has  not  been  substantiated  for  higher  orders,  we 


presume 


:onVergence''does  exist  for  2  and  3  dimensional  systems. 


Whereas -the  linearized  analysis  gives  rise  to  exact 
•  *•  ' 

re3iilts  tlhrough  each  order  for  regular/defect-free  lattice' sys- 
«  *  ,  •  '  * 

terns,  for^ itaperfect  lattices  (those  that  are  irregular,  dis-' 
torted  and/or  contain  s-dimensional  defects)  a  curvilinear 

9 

polytope  linearization  may  still  be  carried  out.  However,  such 
aa  analysis  then  yields  a  bound  to  exact  results  [2] .    It  is 


with  such  bounds  that  we  deal  in  this  paper  for  they  are  the 


result  of  considering  equilibrium  lattice ^models  fqr  fracture 
and  other  materials-failure  phenomena.    We  believe  that  such 

4 


technologically  important  sajad-state  phenomena  are  many-body 
cooperative  events  which  may  be  .charaxiterized  to  f irst  ^der 

I  ' 

by  classical  statistical  mechanics  using  a  lattice  model  with* 
particles  interacting  via  purely-repulsive  potentials/  are 
specifically  J^ntdrestejl  in  the  properties  of  impe:ffeQt,  dis- 
ordered lattices  for  which  we  seek  a  ineasure  of  relative  lattice 
stability.    Within  the  cell-cluster  formalism  "such  a  measure  is 
directly  provided  by  a  change  in  Helmholtz-'free*  energy  iil-  goin^ 
from  a  perfect  lattice  to  a  specific  disordered  conf i^guratioh* 
Alternatively/  it  is  possible  to  employ  ^the  Born  stability 
critetia  for  various  lattice  conf iguratiCHis  .and  deformations  [3] 
For  each  >lattic^/sp|iere  system  we  consider  the  set  of 
all  distinct  nearest-neighbor  pair  interactions,  {(^n^t)}.  Each 
eranent  of  this- set  is  referred  to  is 'a  cell-cluster  of  n-J)arti- 
cles  and  topological    configuration  t.    In  considering  the  pair- 
wise  interaction  of  n-particle$ ^ithin  a  cluster,  the  xertaining 


(N  -  n)  particles  while  constrained "to  their  lattice  sites  de- 

limit  the  accessible  configurations  of  the  particles  \<ithin  the 

cluster^.    In  order  to  ascertain,  the  contribution  of  a  given  , 

cluster  (of  rigid  disks  pf  diameter  a,  for  exarrtple)  to  the  free 

energy  it  is  necessary  to  evaluate  the  canonical  corif igurational 

partition  function,  which  may  be  expressed  as: 

0  -r 

r)  KJ  1=1 

_        tOn^t  i  _ 

\ 

Ay'- 

t 

where  R.  .  is  the  distance  b^ween  an  (ij)  pair' and  the  integra- 

t ion  is  over  a  specific  convex  region,    lit      ,  of  configuration 

n,  t 

T 

space  having  exact  content    (P      .    A(x)  is  the  unit  step 

n,  t 

function.    The  contribution  to  the  Helmholtz  free  energy,  Aj^, 
of  a  specific  cell  cluster  (n,t)  is  then: 
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where  A  =  h(2TnDk„T)"^''^. 


'  The  single-particl^  conf igurational  partition  function 

for  an  arbitrary  lattice  of  2-spheres  (disks)  is  a  curvilinear 

irregular  polytope       ^  of  content    P ^.    In  the  close-packed 

limit,  V  — Vq,  the  curvilinear  bounds  may  be  replaced  by  their 

tangent  hyperplanes  such  that  foj  regular  lattices 

lim  ,  and     lim  For  irregular 

V  — ^  Vo  ^         V  — ^  Vo  ^ 

lattices .  however , 

fRi  ana         are  bounds^  to  R  ^  and 

Similarly  for  higher-order  linearized  polytopes,  ,  the 

n,  t 

content,  fP    ^,  is  a  bound  to  an  exact     iP  ^.    For  singlet 

n,t  ^  ^    n,t  i  "  » 

and  pair  disk  configurations  and  singlet  sphere  configurations 
it  is  possible  to  calculate  the  contends       ^  ^  without  computer 
assistance  although  with  considerable  labor.    However,  in  order 

to  carry  the  cell-cluster  scheme  to  higher  order  the  multiplicity 

<\  > 

and  inordinate  complexity  of  the  polytopes  precludes  a  determi- 


.nation  of  their  content  were  it  not  for  the  polytope  bound 
algoritlun  derived  by  the  late  Professor  Z.  W,  Salsburg,     ^  - 
The  Salsburg  algorithm  has  a  long  and  rich  history. 
The  **homemade**  Rice  University  Computer  was  designed  in  part  in 
order  to  be  able  to  carry  out  such  classical  m^ny-body *integra- 
tions.    The  pioneering  dynamic  storage  allocation  feature  of  the 
Rice  Computer  enabled  such  integrations  involving  millions  of 
pair  configurations  for  small  N  to  be  effecte<f  with  a  relatively 
small  bulk  memory.    Due  to  the  vintage  of  the  electronic 

•"circuitry  (vacuum  tubes)  the  running  times  for  such  computations 
were  enormous  in  comparison  to  modem  standards  (70%  of  a  168 
hr.  week  was  n^t  uncommon  for  certain  configurations). 

The  polytope-bqjind  integration  algorithm  went  through 
several  stages  of  evolution  culminating  with  a  very  efficient 
exponential  polynomial  scheme  [4] .    We  wish  to  emphasize  that 
the  algorithm  effects  an  analytic  integration  and  not  a  numerical 

.  A- Jo 


integration;   This  is  pcfssible  because  of  th^'nature  of*  the 
boundary  li^iearization  giving  rise  to  step  function  bounds  the 

.  '    *  '  '  /  '         ^  * 

arrgu^ient;s  of  which  aire  polyTtomials. 
'  '';  WV  have  ;tj^y^ loped  the  SalsbuTg  ex/ohential  .polyixomial 

^Igotathi^,  iii^f art.ran -y  f Hqs.v  (Earlier 

versions;  w?'re  ,in  .•the;.Gen^  a^''^  ,  -jj^^ 

■■sqh^.ine  his  lje^'fi  g"en^r^^^  order  iq  t|e  ab^le  'to  ascertain  the 

^>robleins  pf  ^{Merest  ,tQ  us  :,(imperfe\^t/' irr'egulto  lattices)  .  > 
•The  alg<i-r-it]in  r.equij-es/the  .pred«;termlnati6n  of  thr^e ".quantities: 

-  •       ^;  ■  •  ■  ■       ■  .  ' 

.!.)■  th^  dimensionality the  integral  (N)  for'Hr-'.    2 A  The 
; nuBber. "of  bounds /CK')'of/th   polytopy  under  •eonsideratat)n.  ,  3.) 
The  equations  of. the' polytope  bounds  a!S.  ^ 
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2.    The  Salsburg  Algordthm 
We  shall  be  copcerried  with  the  integral  orepresentatipn 
,crf  the  classical  canonical,  partition  function  for  hard-sphere  - 
^y^teras.    We  £iJ)llow  the  development  of  Z,  Salsburg,  et  al«  for 

v.,  »  " 

..J        *  •  *  *  -  * 

t^gu.iar  lattices'       and.  generalize  the  prescription  to  accomo- 
y^^e  irregular^  impetfect  iattipe  .configurations.  .The  canonical 
[^"^ti^ion  fulKitibn  fotr.  ^reLy-rfipulsive  systems  may  be  expressed 


AST.  . , 


whfer6  A(x)  is  the  itnit  'step  function  which  restricts  the  center 
of  a  particle  to 'the  int^ior  of  a  pblytope  (Voronoi  region)  * 
defined  in  terms  of  its  bounding  planes  which  are  the  argvunents 
of  the  s^tep  function;    i.ci.  .  * 


^  *      *^  C  Q 
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AOk)  =  {  ^  -  (2,1b) 

0  x<0 


N  • 
if  ^  =  E  a  (i)x    +  a^(i)  ^.  ^  (2.2) 


X     =    tXj^,  .  .  .  ,  Xj^J 


N 

P(x  )  is  a  polynomial  in  the  N  integration  variables.    The  a^  (i) 
are  vectors  (j=l,  N)  of  constants, . i=l,  k,    Eqs,  (2.1b)' 

and  (2.2)  thus  define  the.  boun4ing  hyperplanes  enclosing  an  N- 
dimensional  convex  polytope  ^  which  is  the  dmiain  of  integration. 


The  exponential  polynomial  algorithm  is  based  on  the - 

N 

theorem  of  residues.    Pf  we  let  the  polynoijj^ial  P(x  )  =»  1  in 


Eq.  (2.1a)  we  may  write 


I=/-,/d5"nA(LW)  (2.3) 
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or 


-^^J-'J d^'e^^'-JS^  Y(Lf 5)  (2.4) 


where  the  exponential  function  e^  — ►  1  as  S  — *^  0  and  Eq.  (2.4) 

N 

reduces  to  Eq.  (2,3) •    ^    is  an  N  vector  of  arbitrary,  nonzero 


constlants  and  »S  is  a  real  number. 

As  in  Ref .  4  we  note  that  after  integraw^g  over  the 

/  . .  •  • 

first  i  variables  we  have^  ^ 

\  •  ■  . 

"  - ■  ,  -  ^  :  ■  '■ 

A  single  term  of  this  sum  (the  pth)  has  th^form 


Now  in  Drder  to  integrate  such  a  term  by  parts  over 
define 


and 


dv  =  e^^-'^-dx.  . 


Also,  iet' 


X. 


i+1  "  a 

m 


'^^  \  k=i+2  / 


4> 


K 


t 


is  bounded 


*  Hence, 


r 


/ 

0 


7 


Integrating  over  x^^^^. 


4 


K 

in=l  1 
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{x/\x\,  x^O 
where  sgn'(x)"  =  < 

f    0  x=6 


The  N-dimensional  infegral  results  from  an  integration  over  each 

each  of  which  leaves  a  set  of  N-i-1  dimensional  integrals. 
On  integrating  over  N  variables  the  result  is 


*>  '^--^  c*e 

I  =  lim   /  .—r-,  1  •  C2.7) 

S-»-0    -    "  ^ 


3=1\- 


Applying  the  residue  theorem  to  the  limit  gives  the  desired 


analytic  representation  ^o£  the  canonical  partition  function 


where  the  q^^^  and  c^  are  constants  determined  by  the  integration 
#  * 

procedure. 


5>    Illustration  of  Algorithm;;  The  Niggli  Lattice 

To  illi^tl^ate  the  functional  form  of  this  rather  obscure 
final  result  let  us  consider  the  single  paiXicle  configuration 
space  bound  [R    for  the  Niggli  Lattice  illustrated  in  Fig.  1* 
It  is  necessary  to  specify  the  dimensionality  of  the  integral 

V 

(N  =  2),  the  number  of  bounding  planes  t-o  |R .  (K  =  5),  and 

the  equations  of  the  polytope  bpunds,  iff^,  i  ^  1,...,  5:- 

6  =  1  +  X,        ■  .  •" 

1  » -  •  . 

0  =  1  - 


0  =  1  -  X2  (2.9) 


0  =  1  +  x^  -  X2 


0=1-  .182x^  +  1.19X2 

N  N 

'jc  is  then  an  arbitrary  N-dimensional  function  which  we 
choose  to  be  « 


N    N     ,  e  /.  e  N 

=  ^  •  ^1  *  2:5  •  ^2  *  •••  ' 
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where  e  is  the  Napierian  base.    Thus,  Eq.  (2.3)  becomes  .  * 

/ 

Urn    /  /dx.dx^e  ^  ^^^'^  ""^ACl+xjACl-xJACl-x^)  T 

0  A(l+x^-X2)ACl-. 182x^+1.19x2)  (2.10) 


We  proceed  by  integrating  Eq;  (2.10)  over  the  variable  X2. 
^     Consider  the  five  Heaviside  functions  representing  the  bounding 
polytopes  in  order.    If  X2  does  not  appear  explicitly  in  the      '  ^ 
expression  of  the  bound  (consider  Bounds  1  and  2  of  (2.9)),  then 
the  conteh^-of  that  bound  is  zero,  (Eq.  2.1a),  and  we  proceed 
to  the  next  bound. 

Consider  bound  3  at  tq.  (2.9).    Solve  that  bound  for 


^2 


.        =  1 


Eq.  (2.6)  states  that  the  contribution  of  this  bound 


<  • 
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to 'the  integral  is  given  by- 


I  =  -  lim   -llii  /dx.e  ^  ^  ^•^'^ACl+xJACl-x JA(0)A(x  ) 


0  A(2.19-.182x^)  +...  C2.ll) 


where  the  (-1)  denotes  the  sign  of  the  coefficient  of  X2  in  the 

boun4  3)  ^of  Eq.  \(2.9),  j-g-  denotes  the  coefficient  of  X2  m 

e  '  * 

'  N    N     ^^^1^2  5^  N  N 

a  •x  ,  e  •     results  from  substituting  X2=l  into  a  •x  , 

and  the  Heaviside  functions  result  from  substituting  X2=l  into 

the^previous  Heaviside  functions>  of  Eq,  (2,10),  and  the  +(•••) 

denotes  that  there  will  be  similar  contributions  from  each  of 

the  other  polytope  bounds  of  Eq,  (2,9).    It 'is  worth  noting 

that  for  some  bounds  (5  and  6)  x^  can  not  be  expressed  as  a 

constant  bi*t  rather  as  a  function  ot  the  other  variables 

(x^  =  a  +  bXj^).    Our  procedure  remains  the  same  in  this  case 
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as  we  substitute  this  Wpre^ssion  for       into  and  the 

Heaviside  funct  ions ♦   "  '  \ 


The'' Heaviside  functions  of  Eq,  (2.11)  are  redundant. 

Since  there  remains  only  an  integration  over  one  variable  x^^, 

N 

can  have  only  two  bounds,  upper  and  lower  bounds.  We 
eliminate  the  extraneous  bounds.    A(0)  may  be  eliminated  since 
A(c)  =:  1  where  c  is  a  constant  (Eq.  2.1a).  ,  Fig.  2  illustrates 
our  considerations  in  eliminating  the  two  other  extraneous 
bounds.    We  wish  to  consider  only  the  area" of  the  irregular 
pentagon  of  Fig.  2.    The  bound  A(2.19  -  .182Xj.)  is  defined^by 
the  intersection  of  bounds  1)  and  5)  of  Eq.  (2.9),  thus  extending 
beyond  the  pentagonal  polytope  and  it  is  eliminated.    We  are 
considering  the  area  enclosed  by  bound  3).    This  bound  extends 
from       =  +1  to  x^  =  *0.    Since  A(l  +  x^)  defines  a  bound  (x^  = 

y 

-1)  outside  this  region^N  it  must  also  be  an  extraneous  bound 
^and  is  eliminated*    We  are  then  left  with  ACl  -  x^)A(+x^) 
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as  our  limiting  bounds  on  x^^ 

We  have  presented  a  strictly  qualitative  argument 
for  the  elimination  of  extraneous  bounjis.    For  a  mathematical 
algorithm  it  is  necessary  to  develop  a  quantitative  procedure 
for  eliminating  such  bounds.    This  may  be  accomplished  by 
noting  that  the  two  nonextraneous  bounds,  form  the  smallest 
region  of  integration  for  x^,  (from  -1  to  0).    To  eliminate 
extraneous  bounds  consider  the  Heavisi4e  functions  of  Eq.  (2.11), 
These  are  of  the  form  A(a  +ybx^)  where  ^a  and  b  are  constaivts. 
,  If  b  =  0,  eliminate  the  Heaviside  function. 

If  b  >*  0,  eliminate  all  bounds 'with  b  >  0  except  that 
one  with  the  smallest  value  of  a.    Call  it  lower  bound. 

If  b  <  0,  eliminate  all  bounds  witji  b  <  0  except  that 
-  one  with  the  smallest  value  of  a.    Call  it  upper  bound. 
^,  There  will  remain  two  bounds,  one  for  b  >  0  and  one  for 
b  <  0  as  desired.    Ift*>all  cases  where  N  <  2,  this  procedure 


•assures  th^t  the  integration  will  be  over  the  proper  region. 
Occasionally  extraneous  bounds  arise  due  to  pdir  correlations 
between  two  or  more  particles  which  assume  the  form 


♦where  the  hatched  area  represents  the  area  of  integration. 


Note  that  the  area  between  the  two  bounds^  in  which  we  are 


interested,  is  excluded  and  hence  this  configuration  represent 


an  extraneous  set  of  bounds  which  must  be  eliminated.  For 


such  a  configuration,  the  lower  bound  we  hav.e*  c^culated  will 
be  greater  than  the  upper  bound  we  have  formed.    By  testing 


for  this  property  we  may 'eliminate  all  sets  of  two  bounds. 


*  both  upper  and  lower  bounds,  where  lower  bound  >  upper  bound. 
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With  these  ^considerations  Eq.^(2.11)  may  be  written  as 


I  = 


im  — —    I  dx 


1 


(2.12) 


We  may  now  proceed  to  integrate  Eq.  (2.12)  over  the 


Variable       in  the  same  manner  as  we  integrated  Eq.  (2.10)  over 


X2.  Then; 


I  =  Urn 


S 


2.5  L 


@  A(1)A(0) 


+  ... 


(2.13) 


where  the  quantity  in§ide  the  first*  set  of  C  )  is  obtained 
frpm  considering  the  bound  A(l-x^)  of  Eq,  (2,12),  +1  is  (-) 
the  signjof  jthe  constant  multiplying ,>^Cj  in  that  He^viside 
^functioji.  '  S(l)  is  the  constant  multiplying  Xj  in  jji  -jc 


in  Eq.^  (2.12).  '  s(l  +  ytX^^^^^"^  '  ^^'^  ^^^^  result  frop ' solving 
A(l  -  Xj)  for       =  -1  and  substituting  this  value  into  the 
exponential  and  Heaviside  functions  of  Eq.  (2.12).    The  quantities 


inside  the  second  set  of  (  )  are  obtained  iii  like  fashion  for  .' 


A(+x)  at  Eq.  (2:12).    if  we  note  that  A(01  and  A(l)  -  i,  Eq. 


(2.13)  is  of  the  form 


I  =  lim 


S->0  S 


2% 
2.5 


] 


j+  ... 


C2.14) 


,  We  similarly  obtain  the  results  of.  integrating  over 

the  other  bounds.    These  are 


1.)  ZZZ>  0 


2.) 


4.) 


■ ; 


zts       *  275 


r  -S    ^  275] 

L-e    ^e    _     J  . 


5.) 


^2  _c_ 
^  2.5 


153 


\  3ii 
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Eq.  (2*14)  is  now  repjesented  as  the  sum  of  1  thru  5),  T&king 
the  limit  as  S^-^O,  according  to'^tKe  residue  theorem,  we  obtain 


ri'+ ci^  [if        c  . 

c  c[l  +  cj      1  +  c  ' 


.c[l  +  .153c]  ^  c(l  +  •ISSc) 


where  c     j^-*    Th®^  \  -  3.19rf.  ,     '  V 

We  have  adopted  this  algorithm  to  the  Univac  1108 
computer  using  Fortran  V.    A  calculation  which  would. take  oft  the 
order  of  a  month  by  hand  and  twenty  minutes  of  computer  time 
for  a  numical  integration  takes  less  than  half  a  minute.  In 

4 

cdi^ast  to  the  original  Salsburg  algorithm  written  in  Algol, 
our  Fortran  version  may  be  used  to  determine  the  conf igurational 
partition  functions  o£  deformed  and  disordered  lattices.  The 
Fortran  version  has  enabled  us  to  model  cooperative  lattice 
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phenomena  within  the  Cell-cluster  expansion  .formalism.  For., 
such  models  we  cannot  overemphasize  the  utiiity  of  the  Salsburg 
algorithm  and  this  generalized  veWion  of  ,it. 
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HIGH   DENSITY  APPROXIMATION   TO  PAIR 
SUBFIGURES  OF  NIGGLI  .PACKING,  I  ■  < 


FIGURE  1 
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 ."Free  Area^' 

J.  Extraneous  Bounds 


NIGGLI  I  "FREE  AREA"  and 
"EXTRANEOUS  BOUNDS" 
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Program:  'SYMMETRY' 

Author:         Dr*  Ronald  A.  Wohl 
School  of  Chemistry 
Rutgers  University 
New  Bninswick,  .N-J-^089C5 


Abstract:     A  computer  program  module  is  described  which 
determines  the  molecular  point  group  of  a  molecule  and  prints  the 
corresponding  character  table*    The  gr.oup  character  table  is  used 
in  a  form  suitable  for  subsequent  use  by  other  program,  modules 
which  are  able  to  perform  a  coii5)lete  symmetry  analysis  of  the 
molecule,  i.e.  they  can'e/g.  determine  the  atomic  representaCti on, 
the  structure  of  the  representation  (irreducible  represeni^atiohs), 
sets  of  equivalent  atoms,  atomic  orbitals  or  internal  coordinates 
and  set  up  symmetry  coordinates  or  symmetry  orbitals.  These 
modules  may  be  used  in  a  wide  variety  Qf  programs  for  vibrational 
analysis,  molecular  orbital  theory  and' electronic  spectroscopy, 
crystal  field  theory  and  crystal  structure  determination. 


The  concepts  of  sviranetry  akd  group  theory  have  found  widespread 
use  in  chemistry^"?.    Thus. the^ molecular  point  groups  are  used  In 
stereochemical  arguments^^  (chiralityj  chemical  equi^ralence  oi 
atoms  and  bonds;  presence  or  absence  of  a.dlpole  moment),  vll^a- 
tional  analysis^)  (symmetry  species;  degeneracy  and  activity  of 
fundamentals,  overtones,  combination  bandsj  .selection  rules; 
;5ymmetry  coordinates;  polarizatioH\of  Raman  bands),  moleculsr 
orbital  theory  and  electronic  spectroscopy ^^"^z  (Species  and 
degeneracy  of  molecular  orbitals,  selection  rules,  vibronic 
transitions)  and  crystal  field  theory^)^)  (>crystal  field  splitting; 
correlation  diagrams;  spectral  and  magnetic  properties).  The' 
extension  from  point  groups  to  lattice  groups  includes  applications 
in  the  vibrational  analysis  of  crystals. (lattice  modfes;  selegtion  ■ 
rules;  sit.e  symmetry  and  factor  group  ianalysis),  other  pt^sical 
properties  of  ci^ystals^)  and  the  determination  of  the  space  group 
ana  bryst.al  class  itself. 
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Although  countless  computer  programs  in  the  above  subject  fields 
have  been  written  very  few  programs  so  far  have  made  use  of  symmetry 
at  all,  and  no  program,  to  the  best  of  our  knowledge,^  has  handled 
the  entire  symmetry  analysis  completely  by  itsel,f»    In  those  few   ;  ^ 
programs  making  limited  use  of  syinmetry,  the  symmetry  information 
has  usually  to  be  "set  up"    by  hand  and  entered  as  ^data  specific 
for  the  problem  at  hand.  (E.g.    The  U-matrix  in  vibrational 
analysis  for  converting  internal  coordinates  to  symnetry  coordinates; 
or,  the  sets  of  equivalent  nuclei  in  NMR 'Spectra  simulation  programs). 
The  disadjrantages  of  this  procedure  are  obvious. 

It  is  fo^^  this  reason,  that  we  have  writt/en  a  program  module,  that 
completely  automatically  wiT:hout  any  input  other  than  the  masses 
and. coordinates  of  the  atoms  determines  the  molecular  point  gtoup 
of  a  molecule  and  prints  the  corresponditig  character  table.  The 
group  character  table  is  used  in  a  form  suitable  for  subsequent 
use  by  other  program  modules,  which  can  perform  a  complete  symmetry 
analysis,  such  as  determine  the  atomic  representation,  the  structure 
of  the  representation,  the  sets  of  equivalent  atoms  and  internal 
coordinates,  and  set  up  symmetry  coordinates  and  symmetry  molecular 
*  orbitals.    The  extension  to  the  symmetry  analysis  of  crystal 
lattices  is  easily  feasible  and  planned  for.  the  near  future.  The 
three  most  important  subroutines  of  this  program  module  are  briefly 
discussed  below. 

Subroutine  PMTGR 

This  subroutine?  is  the  central  part  of  thfe  program;  it  determines 
all  ^molecular  point  groups  up  to  order  8  .of  the  main  axis  of 
symmetry  ^dth  the  exception  of  those  point  groups  giving  rise  to 
spherical  topa,  which  are  hstndled  by  the  subrouti^ne  SPHERE. 

A  siiT^lified  £low-chart  of  this  subroutifie  is  shown  in  Fig.  1.^ 
This  flow-chart  is  based  upon  the  distribution  of  the  various  types 
of  symme4:ry  elements  in  the  various  classes  of  point  groups,  v/hich 
are  summarized  in  lable  1.    Similar  procedures  and  flow-charts  'for 
the  ^systematic  ident'ification  of  the  point  group  by  visual 
inspection  of  a  molecular  model  have  been  published*^/.    The  appli- 
cation to  a  computer  program  required  some  modifications.  A 
particular  problem  not  present  in  the  visual  inspection  is  the 
specific  orientation  and  reorientation  of  the  molecule  in  the 
Cartesian  coordinate  system  required  during  the  computer  evaluation. 

The  presence  of  the  various  possible  symmetry  elements  is  tested  for 
'  by  applying  the  corresponding  transformation  matrices  to  the 
molecule  and  checking,  whetl)er  the  transformation  corresponds  to  a 
Symmetry  operation.    The  mathematics  required  for  this  part*  are  well 
known  and  can  be  found  in  all  books  dealing  with  molecular 
symmetry 5 . 

Jfrspecial  points  may  be  mentioned. 
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Table  1:-  Distribution  of  the  symmetry  elements  in  the  Various 
types  of  molecular  point  groups. 
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=  otder  of  symmetry  element  C  ,  i.e.  order  of  mai^  rotatory  • 

axis  (=Z-axis)  •  -    ^  ^ 

=  horizontal  plane;  d  =  v^r-feical  plane;  Cp'  '=  perpendiciaar 

2-fold  asMLs;  +  means  symmetry  element  is  present;  -  means 

synmetry  element  is  ahserit. 


a)      The  symmetry  plane  is  equivalent  to  Si.    See  also  h). 

h)     The  molecule  possesses  one  plane -of  symmetry.    Normally  this 
plane  cannot  he  designated  as  a  horizontal  or  vertical  plane. 
During  the  point  group  evaluation,  the  molecule  can  have, 
however,  its  symmetry,  plane  oriented  either  'Tiorizontally"  , 
(i.e.  in  the  xy-plane)or  'Vertically*'    (perpendicular  to  the 
xy-plane,  i.e.  through  Z-axis). 

c)  Includes       =  molecule  contains  no  symmetry  element  except 
the  identity. 

d)  .    This  symmetry  element  is  inpl^ed  by  the  remaining  symmetry 

elements,  but  is^  riot  tested  for  dioring  the  computattLon. 

e)  it\cludes  S       C  . . 


■  C13 
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Fig.  1  »  FLOl-fCHART  OF  SUBROUTINE  PNTGR 


9 


CALL  MOMIN 


YES 


CALL  SPHERE 


EETERMINE  IZ,  lY,  IX 


P.G.=  C  t 

ooh  1 

i 

YES 

IHPL=0' 


IVPL=0 


TY=I>v   NO  sj 


ROTATE  MOLECULE 
SO  THAT  IZ^IY, 
AND  IZ^IX. 


YES>^^IZ= 


s007 


,JES_^ 

P.O.  =  D 

oov 

IHPL=1 


NO 


IVPL=1 


iz=r 


.  YES  ^ 

P.O.  =  c 

—  s 

7^ 


YES 


NO 


ROTATE  V.PL. 
INTO  XZ-PUNE 


ERIC 


3^0 


5-66 


i. 


(CONnmJATION  OF  FIG*  li  FLCWCH/tRT  OF  SUBROUnKE  PNTGR 


V   NO--  X, 

ROTATE  P.2A. 

i 

INTO  XZ-PLANE 

1 

N 

f 

P.G.  e  D 

n 

YES 


NO 

^.  > 

^P.G.  =  C 


n 


LEGENDt  ^  .     :        .  < 

IZ'     =  Order  of  rotation  axis  collinear  wilii  the  z-uxis 

lY         r:      •»       «  H  II  tl  II     ^    It  y^is 

IX      t=     »     "        "         "  "  \    II «  X-axis 

KIPL   tz  IndeXf  lAether  horizontal  plane  of  symmetry  is  present 

(t=  1)  or  not  0)t 
IVPL   B  Indexf  ^Aether  vertical  plan©  of  symmetry  is  present  ' 

(=  1)  or  not  (=0).  '     '  - 

IP2A   =s  Indexi  ^rtiether  perpendicular  2-fold  axis  is  present 

(i=  1)  or  not  (=  0). 
H»PL»  =  Horizontal  plane  of  symmetry 
VtPLt's  Vertical  plane  of  symmetry 
Pt2A.  =:  Perpendicular  2rfold  axis 
PtG»'  =  Point  group 
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1.  The  molecule  Us  initially  oriented  after  its  principal  moments 
of  inertia  by  the  subrcaitine^  MCKEN.    Symmetric  top  molecules  are 
orietited  such  that  the  main  axis  of  symmetry  is  collinear  with  the 
Z-axis.    '   '  .     >*  . 

2.  The  order  of  the  rotation  axis  collinear  with  the  Z-axis  is 
detenninedjby  rotating  tlje  molecule  around  the  Z-axis  by  the  angle 
^TT/n,  where  n  starts  with  8  and  decreases  to  2.    Thus  to  find  the 
correct  carder  of  the  main. symmetry  axis  it  is  necessary  to  test 
for  the  highest  possible  order  first. 

J.    Symme.tr ic  top  inolecules  present  a  special  problem  similar  to 
that  posed  by  spherical  tops  (see  belovf)  although  of  less  complexity. 
Namely  the  mblocule  cannot  be  preorien^e'd  in  the  xy-plane 
because  the  correiSponding  two  principal  moments  of  inertia  are 
equal.    Thus  the  orientation  of  possible  ^vertical  planes  of  symmetry 
and perpendicular  2- fold  axes. .of  rotation  cannot  be  known  a  priori. 
Therefore  all  pessible  pairs  of  atoms  are- tested  for  being  both 
suitable  -and  symmetry  equivalent.    Suitable  in  this  context  means, 
that  both  atoms  have  the  same  atomic  weight  and  the  same  Z  coordinate 
'Symmetry  equivalent  means  'that  i  transformation,  metrix  can  be 
found,  which  transforms  atom  A  into  atom  B  and  simultaneously  is 
also  a  symmetry  operation  for  the  molecule-  as  q.' whole. 

\.    After  the  point  group  has  been  detei^mined,  the  ijiolecule  \d.ll  be 
reoriented  in        Caijtesian  coordinate  system,  if  necessary,  x  to 
follow  all  the  re6ommendations  given  in  the  Mulliken  repo'rt^^ . 

Subroutine  SPHERE  "     *  * 

This  subroutine  determines  the  point  group ^of  all  spherical  tops, 
i.e.  the  point  groups  related       the  geometrically  regular  solids:  ^ 
tetrahedral:    T,  T,,  TJ^ -octahedral':  0,  0^  and  icosahedral:  I, 
(The  subroutine  also  evaluates  thef'rare  case  of  accidentally 
degene^rate  spherical  tops)..  ^^        .  \ 

The  complete  procedure  for  the  determination  of  the  spherical  point 
groups  is  rather  complex.    Spherical  top  molecule's  cannot  bfe 
oriented  easily  after  a  main  axis' of  symmetry,  since  the  5  principal 
moments  of  inertiST  are  equal.    Therefore  no  symmetry  elements' 
oriented  a  priori  in  tha  Cartesian  coordinate  system  can  be' used  f<3p 
testing  the  molecule.    (Neither  must  any  atom  of  the.  molecule  lie 
on  a  symmetry  element)^.    The  rather  complex  procedure  necessary. 
1/hen  is  the  following:    All  possible  petirs  of  atoms  of  the  moldcule 
are  tested  for  being  suitable  and  for  being  symmetry  equivalent.  A 
pair  of  atpms  is  suitable  if  both  atoms  have  the  same  atomic  weight 
and  the*  same  distance  *from  the  center  of  mass.    The  two  atoms  are 
then  symmebjjy^equivaient  if  a  transformation  matrix  can  be  found,  '  ^ 
Vihich  tranJ^s^ias  atom  A  into  ^tom  B,  while  being-  at  t^ie  same  time  . 
a,  symmetry  ^^l^ration  for  the  molecule  as  a  whole  .    Given  the  two 
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{)oiivts  A  and  B  and  the  order  of  the  sought  rotation  axis,  the 
orientation  of  the  axis  (of  which  there  are  2)  can  be  calcxilated* 
by  analytical  geometry*    The  corresponding  trans formation ^matrix  can 
then  be  set  up  and  be  tested  foj:  being  a  symmetry  operation. 

Fig.  2  illustrates  tbe  geometric  construction  we  h*av6  .used  in  order 
to  calculate  the  orientation  of  a  possible  rptation  axis  of  ord^r 
n,  which  transforms  atom  A  into  a.to^  B.  > 

Fig>  2;    Location  of  points  P  and  P;  which  define    rotation  axis  of 
order  n  to  transform  ]^oint  A  into  poirjt  B. 


OP  defines  rotation  axis 
V      C  't:  midpoint  Jsetween  A  iLnd  B 

n   rr  order  of  rotation  axis 
Thent    OPX^  PC 
CAJ.CP 

->y  .    (Similarly   or  P»  in  place  of 


P) 


Subroutine  CPIRTBL  ,  .  '  ^ 

This  subroutine  prints  the  complete  characifer  table  of  the  molecul^ 
point  group  determined  by  the  subroutine 'POTGR.  '  ^     "  | 

In  addition  the  subroutine  ,performs  a  lot  of  "invisible  bookkeeping", 
namely  it  sets  up  all- arrays  and  in^i^oes  for  later- >use"  of  the 
character  table "  '  , 

The  handling  of  tliis  character  table  required  by  the  program  is  a  ~ 
rather  involved  and  lengthy  procedure.  .  Basically  the  character 
table  is  stored  in^  the  subroutine  and  used  by  the  program  in  a 
^'compressed"  form  similar  to  the  printed  form,  i.e,' similar  to  the 
form  usually  found  in  textbooks^).-  A  subsequent  program  module, 
which  will  be  discussed  briefly is  then  capable  of  "reading"  eacK 
symmetry  class  symbol        s4tting  up  the.  symmetry  transformation 
matrices-  for  all  the  symmetry  operations  of- this  symmetry  class. 
With  the  transformation  matrices '•for  the  entire*point  group  :|.t  is 
then  conveniently  possible, to  carry  out  any  desired  further 
symmetry  analysis  of  a  molecule,  such  as  to  determine  sets  of 
equivfilent  atoms,  internal  coordinates,  atomic  orbitals  ancl  to  set 
up  symmetry  coordinates  '^r  symmetry  orbitals. 


other  subroutines  iircluded  in  the  program  module  "  SYMMETRY"  • 

TEST  PNTGR:  '  The  main  program  vftiich  handles  input  and  output  and 
c;^T1,s  .HJTGR.  and  CHSTBL.  .      /  '       .  • 

Determines  the  principal  momexvi^  of  inertia. 
.JCDIAG  :    ftilagonalizes  a  symmetric  matrix^b€4ijg^he  Jacobi  procedxire, 
^ ,  Needed  by  MCMIN . .  . 

ORIWC*I;    Determines/^ whether  a  molecule  is  a  asymmetric,  symmetric 
'    or  -spherical  'top/'fwftether  it  is  line'ar  or  planar  or 
,  nonplaneo:.  ' 
ROTATE:    Rotat^V'  a  ^glecule  by  fic^plication  of  the  transformation 

•  matrix.  ^     *  ,     '  ■ 

ROTAX.  :.  -Rotates  » a  mplectCle  around  one  of  the  coordinate  axes. 
ROTPMP:    Calctilates  the  Cartesian  coordinates  of  the  2  points  which 

defiiie  possible  axes  of  rotation^  for  a  nonoriented  spherical 
top  mdflecule.  "    [  .  *  '  '  ' 

ROIMAT;    Peteridnes  the  desired 'transformg,tion  matrix^  given  the 

poiAt(^)  .determined  by  PNTROT  ^d  the  possible  order  n  of 
.      \the  symsnd try  . axis  ^ 
TEST    :    Tests,  whether  a  transformation  matrix  corresponds* to  a 
symmetry  operation  for  .the  molecule  xind^r  consideration - 
PLTCRT:    Plots' the  molecule;    Fpr  this  a  connectivity  table  of  *the,  . 

molecule  liiust  be  entered.         .      '  *  \  -  \  ' 

BATE    :    Writes  the  date  of  the*-day*   *  '   .  - 
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The  Computer  and  Its  Role  in  the  Development 
of  the  Undergraduate  Chemistry  Curriculum 

;         ^  Joseph  R.  Denk 

Curriculum  Development  Manager 

North  Carolina  Educational  Computing  Service 

P.  0.  Box' 12175 

'Research  Triangle^^Pajrk,  North  Carolina  27709 

Undergraduate  chemistry  provides  a  strange  spectrum  of 
^computer  activity  when  one  considers  that  our  quantitative  science 
surely  should  provide  the  fitting  environment  to  at  least  the  pro- 
blem-solving mode  afforded  by  a  computet.    The  spectrum  of  curric- 
u!tum  involvement  runs  from  no  usage  in' large  universitiA  as  well 
as  small  colleges  to  almost  total  permeation  as  at  the  University 
of  Texas  in  Austin, and  at  St.  Andrews  Presbyterian  .College  in 
Laurinburg,  North  Carolina.    What  is  strange  is  that  the  spectrum 
does  not  resolve  itself  simply  into  the  problem  of  the  "havjs^s"  ' 
and  the  "have  not*s"  with  respect"  to  availability.    ,Some  Univer- 
sities with  graduate  schools  loaded  with  computer  power  haven *t  a 
single  undergraduate  course  in  chemistry  involving  the  computer. 
Several  small  colleges  with  a  single  teletype  or  with  a  programmable 
calculator  have  curricula  for  which  the  computer  is  essential. 
What  is  not  strange  is  the  fact  that  the  computer  is  still  an  infant 
in  education  and  curriculum  development.    Even  more  clear  are  the 
financial  barriers  to  usage.    Strange  and  also  disturbing  ar^  two 
opposing  factors:    the  departmental  non-financial  barriers  to .usage 
of  the  computer  in  education  and  the  apparently  forced  interest 
in  getting  gtat^  money  for  computer  curriculum  development  ks  a 
result  o£  cut-backs  in  research  fund%  (NSF  reports  200  proposals 
a  month  far  computer-based  curriculum  development  in  chemistry).  . 

This  sesSyion  of  the  Conference  on  Computers  in  Chemical  Education 
is  dedicated  ik^exploring  the  frontiers  upon  which  decision-makers 
embark  when  they  feel  they  are  willing  to  Introduce  the  computer 
Into  their  chemistry  curriculum.    The  strange  spectrum  of  usage 


indicates  that  there  are  many  on  these  frontiers  and  .also  that 

many  are  searching  for  the  frontiers  themselves.    Further,  the 

doubts  as  to  the  validity  of  th^  use  of  the  tool  relative  to  Its 
I.  • 

cost^seem  very  probable  as  contributing  to  the  current  state  of 
the  art.    It  is  also  probable  that  the  modes  of  potential  computer 
usage  still  remain  unknown  to  many. 

*  I  see  the  major  front^iers  to  computer-based  chemistry  curric- 
ulum development  as  being  included  in  one  or  more  of  the  following 
five  general  questions:  ^ 

(1)  What  size  computer  will  best  serve  my  curriculum  needs?, 

(2)  Should  we  teach  computing  itself  in  the  chemistry , curriculum? 

(3)  What  computer  languages  are  best  for  chemistry  education? 

(4)  Should  students  write  all  of  the  programs  or  is  the  usage 

of  *'canned"  programs  valid  'for  curriculum  developmint? 

(5)  What  roles  can  the  computer  play  in  the  curriculum? 
Answers  to  the  last  four  question^  are  being  pursued  in  several 
efforts  both  with  large  and  small  computers.    The  teaching  of 
computing  and  the  testing  of  languages  are  going  on  in  both 
environments*    Programming  by  students  and  the  use  of  materials 
created  elsewhere  are  producing  a  healthy  conflict.    In  the  pro-r 
cess,^  curriculum  innovation  is  emerging  as  a  result  of  roles 
played  by  the  computer  wh'icTi  are  undreamed  of  by  toany. 

A  major  issue  has  emerged  on  these  frontiers  —  the  issue 
of  whether  or  not  curriculum  development  rests  only  on  the 
production  of  indigenous  materials.    For  those  teaching  computing 
itself,  doing  one^s  own  thing  in  pfogr^paming  is  usually  the  only 
/way.    For  the  vast  majority,  however,  the  hopes  lie  in  the  possibil 
ity  of  reaping  the  programming  results  of  others*    Several  centers 
for  exchange  of  computer-based  chemical  materials  are  ewtant  for 
this  majority.  . 

s 
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EIN  was  set. up  to  test  whether  or  not  programs  would  be 
by  r^ote  users  on  the  computer  vWhere  they  exist  with  I/O 
going "through  the  mail.    This  process  has  proven  ineffective. 
ARPA  l^as  attempted  to  tie-up,  by  hard-wire' connections,  several 
computers  so  that  access  would  be  "direct". .  A  successful  hard- 
wit^  graphics  network  exists  out  of  Santa  Barbara.    The  exchange 
of  materials  via  hard-wire  connection  is  coming  b'jt  ia  Loo  far  in  ^ 
the  future  and  too  expensive  for  the  majority  who  n6ed  materials 
now.    A  third  mode  of  exchange,  the  physical  transportation  of 
materials,  is  occuring  at  Eastern  Michigan  University  (CECCP)  , 

at  the  North  Carolina  Educational  Computing  Service  (PALS)  ,  at 

3 

Illinois  Institute  of  Technology  (COPES)  ,  among  several  such 
centers.    .This  mode  of  exchange  is  fraught  with  difficulties 
unimaginable  to  the  novice  but  light  is  being  shed  and  exchange 
Is  going  on.    EIN  reported  that  it  catalyzed  several  hundred 
exchange  efforts  even  though  it  was  not  set  up  to  do  this.  ^ 

The  papers  selected  for  thi'^  section  represent  depth  in. 
all  of  the  frontiers  mentioned  above.    The  first  invited  speaker, 
K.  Jeffrey  Johnson  of  tjie  University  of  Pittsburghf>  will  report 
on  almost  all  of  the  frontiers  involved  in  the  use  of  a  large 
computer:    teaching  programming,  creating  materials,  computer 
languages,  hard-wire  exchange,  and  the  sharing  of  nwteriais. 
For  the  choice  of  computer  size,  Johnson's  presentation  will  be 
.contrasted  with  that  of  Professor  Klopfenstein  of.  the  University 
of  Oregon  who  will  present  the  case  for  the  small  computer.  Pro- 


^Centcr.for  the  Exchange  of  Chemistry  Computer  Programs.    For  Infot 
mation  write:    Prof.  Ronald  Collins,  Eastern  Michigan  University, 
Ypsilanti,  Michigan  48197.  '  ,  , 

^Program  and  Literature  Sei^yice.    For  information  write  the  author 
of  this  pape^. 

^Cooperative  Program  Exchange  Service.    For'infomiatlon  writ^,:  « 
Ronald  Stiff,  Illinois  Institute  of  Technology,  Chlcagd,  Illinois 
60616.  ,  •  •  '  ' 


fcftaors  James  and  Koc\\  of  the  University  of  Northern  Colorado  ^ 
will  offer  contrasting  results  on' the  need  for  teaching  computing 
with  those  of  Professors  Klopfenstein  and  Jolmaon.    These  three 
papers  will  also  offer  conflict  on  the, decision  for  choosing 
programming  lan^;uages.  ,  ^ 

All  of  the  papers' will  substantively  touch  the  materials 
Involved  ^n  the  chemistry  curriculum.    Al  Matsen  of  the  University 
of  Teacias,  who  will  present  the  second  ihvited  paper,  will  bring 
tp^t  some  of  the. most  innoVative  roles  of  the  computer  In  chemistry 
education.         Warren  Smith  of  Earlham  College  will  add  further  ' 
innovation  in  experimental  design.    The  p(roblenis  ot  transportability 
will  become  evident  in  aH  of  these  papers  and  will  be  touched 

*  directly  by  Larry  Sherman  of  North  Carolina  A.  &  T  State  University 
and  K.  Jeffrey  Johnson!  ,  Contrast  in  opinion  on  whether  or  not  to 
use  "canned"  programs  will  be  evident,    A  spectrum  o^  materials 

^  will  be  presented  running  from  those  which  aftre  totally  non-trans- 
portable to  simply  movable  systems. 

It  is  hoped  that  these  papers  help  the  decision-maker  who 
wants  to  upgrade  his  curriculum.    To  the  attendee,  this  helpj 
should  come  in  choosing  computer  size  and  programming  languages. 
A  key  question  should  enter  the  attendees  mind  throughout:  ^an 
"  I  obtain  materials  to  get  started  or  should  we  go  from  scratch   ^  • 
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'  *   Curriculum  Development  Through  Coinputer-Basjed  Mat§rials- 

'Graphics,  Tutorials  and  Progranming  Languages"  ' 

K.  Jeffrey  Johnson     ^  - 
Department  of  Chemistry 
.  .*  University  of  Pittsb'urgh  , 

Pittsburgh,  Penna.  15^215 

Computing  techniques  are  being  used  in  a  number  of  -ways  to 
increase  the  effectiveness  of  chemical  education.    In  this, paper 
I  will  discuss  t^e  results  of  three  applications:  -an  interactive 
graphipcs- system,  tutorial-drill  CAI  programs' Btnd  teaching  computer 
programming.  '  '  ^ 

Graphics 


'  '      Pitt  was  one  of  the  Universities  in. the  NSF-sporisored 
interactive  graphic;s  network  centered  at  the  University  of  Calif-, 
omia,  Santa  Barbara.    The  software,  is  an  extension  of  the  Culler- 
Fried  system,  and  is  supported  by  an  IBM  560/75.    A*  bloc^  diagram 
of  .the  remote  tenninal  is  shown  below.  ^ 


Keyboard 


00000000000 

'^^8888888^' 

^OQOOQOQ 
OOOOOOOOOi 

888888' 

pooooo 


Write -only 
TTY 


Tha;^user  interacts  with  the  system  by  pressing -keys  on  the 
keyboard.    Eech  tiijie  a  H,ey  is  pressed  a  \)it  string  is  transmitted 
to  Santa  ^sirbara,  an  operation  is  performed,  and  trie  result 
returned  and  displayed  on  the  scopy.    If  a  permanent  copy  of  the 
p3;ogram,  nupieric  values  or  a  plot  is  desired,  the  output  can  be 
directed  tojtne  teletype  and/pr  plotter. 
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»    A'  number  of  profJirains  to  /imulate,  cheipical  systems  have 
"been  written.    Table  I  contains. *a  brief  description  of  these 
programs . 


Name 


NMR 


EXCHANGE 
CONTOUR 

RADIAL 
EQUIL 


HEAT 


KIN  (1) 

KIN  (2) 

DERIV 
EDTA 

NEENST 


Table  I.  ^  Grapftics  Programs 
Description 

Simulate  AB,  ABg,  ABX  and  A^Xg^  high- resolution. NMR 
spectra.     Parameters;    chemical  shifts,  coupling 
constants..  • 

Two  .site  NMR  exchange.     Parameters:    lifetime,  shift. 

Constant  y-^Y  contour  lines  drawn  for  the  ls,2p,5p  and 
5d  orbitals.    Parameters:    effective  nuclear  change^ 

'  max* 

Hydrogen  atom  "radiisCL  distribution  functions  (ls,2s,2p 
5p,5d).    Parameters:    quantum  numbers; 

"  Calculate  equilibrium  cqnstant/by  minimizing  total 
free  energy  function.    Outputs    total  free  energy  vs. 
extent  of  reaction.    Parameter^:    temperature,  stoich- 
iometry,  standard  chemical  potentials. 

Enthalpy  of  reaction.     Integrates  empirical  Cp  func- 
tion.   Output:    display  of  function,  integral. 
Parameters:    empirical  constants,  temperat^e  range, 
enthalpy  at  one  temperature. 

Kl  K2 

Simulates  A->  B->  .C.    Output:    A,B,C  vs.  time. 

Parameters  Kl,  Ka,"A  . 

J.  o 

A;!       C.    Parameters:    U  rate  constant^,  A^. 

;  '  -        ^  ' 

Determines  the  inflection  point  of  a  sigmoid  cuire. 

Simulate  M^^  -  EDTA  titration.    Parameters:    K«,  pH, 

,  Calculate  EMF  for  M|M^^  electrode  in  HgO-NH^  solution. 
Output:    EMF  vs..  pH,    Parameters:    Z,  to,  successive 
formation  constants 


r 


Table  I.  ^Graphics  Programs  (Continued) 

Description  ^ 

Calculate  solubility  cTlzrve  of  hpCp;  where  X  is  conju.- 
gate  base  of  HX.    Output:    Solubility  and  log  of 
solubility  vs.  pH»     Parameters:    Ksp,  Ka. 

Metal-ligand,  equilibria.  Output t  fractioi^of  each 
species  vs.  log  of  ligand  cone.  Parameters :Tflumber 
of  ligeuidS;  stepwise  fonhation  constamts,  • 

Simulates  fetation  function  in  metal-ligand  systems. 
Parameters:    number  of  ligands,  stepwise  fonnation 
constants • 

The  .output  of  these  simulation  programs  has  been  recorded 
on  Super-Smm  film,  and  55-miii  slides  >and  prints.    Film  loops  are 
currently  being  prepared, and  the  slides  and  prints  can  be  used  in 
several  courses. 

During  the  faj.1  term  of  1970,  a  memo  >{/as  sent  to  all 
students  enrolled  in  physical  'chemistry  courses.  Interested.^ 
students  were  invited  to,  sign  up  for  special  one  credit  course  in 
programming  the  interactive  graphics  system.    The  course  met  one 
evening  each  week.    Eight  students  ervrolled.    After  two  hours  the 
students  were  able  to  start  programming.    The  only  requirement  was 
a  documented  project  due  at  the  end  of  the  term.    Most  of  the 
students  chose  their  own  projects.    They  included:    a  simxiTation 
of  van  de^  Walls  forces,  a  study  of  the  s.olubility  behavior  of  *  ^ 
silver  halides  as  a  function  of  halide  concentration,  an  emalysis 
of  the  Michaelis-Menten  equation  and  Lineweaver-Burk  plots -of 
enzyme  inhibition  systems,  ^rid  a  simiilation  of^the  Monod-Wyman- 
Changeux  model  of  enzyme  coopera"tivity.    One  of  the  stjiui^nts  was 
so  hooked  on  the  system  that  he  wanted  more  information  about  thg 
NSF  proposal.    He  was  going  to  ^graduate  school  and  wamt.ed  to  get 
a  terminal.  .         •  >  * 

Tutorials  "     "  •  ' 


The  Pitt  Computer  Center  supports  the  CAI-oriented 


Name 


LIGAND 


N 


6^ 


language  CATALYST/PIL.    CATALYST  (Compiitea>  Assisted  Teaching  and 
•  -Laming  System)  is  a  prod^ssor  designed  to  facilitate  the  writing 
'  of  CAt  AessOTs.    PIL^  is  a,  JOSS-derived  conversational  language. 
The  CATALYST/PIL  interface  provided  both  lesson  designer  and 
student ^ easy  access  to  desireable  features,  of  both  langxmges.  The 
student,  fctr  example,  can  use  the  desk  calculator  mode  of  PIL  to 
solve  cjtiemical  arithmetic  problems..    The  lesson  designer  con  build 
in  random  numbers  vherever  desireable  so  that  no  two  students 
receive  the  s.ame  problems.    It  ^is  alsq  easy  for  the  lesson  designer  . 
to  build  in  branches  if  the  student  asM  for  he^p,  more  infohna-  - 
'  tion,  is  off  by  a  factor  of-  tyo,  etc.  •  \ 

IJhe  emphasis  has  been  on  nimierical  proMemB,  «  There  are 
•  ;20  tutbrial-drill  lessons,  on  stoichiometry,  gas  laws,  colligative 
.  ^properties,  Faraday's  laws  and  aqueoti^  equilibria.  *  There  are  ^• 
Sreroal  lejssons  on  bonding  theories,-  periodic  properties  and  elec-' 
trolytes.    .The  lessons  average  20  minutes  of  terminal  time. - 

In  principle  this  appr^^acb  has  a  nmber'of  advantages. 
The  com{)uter  is' an  infinitely  patient  tutor.    Stu&ents  c'an  use  PIL 
,.to  solve  stoichiometrv  and  pH  problems.    At  a^y  point  the  student 
can- type  help  and  get  an  appropriate^  hint.    If  the  student  ty^es  a 
wrong  answer,  the  cotoputer  will  analyze  it  for  the  two  or  three 
most  common  mistakes  and  warite  an  appropriate  message,  ("Did  you 
forget  tQ  convert  from  degrees  Centigrade  to  degrees  Kelvin?").    •  , 
After  two  or  three  incorrect  responses  the  computer  will •outline  , 
the  correct  solution  t6  the  problem. 

There  are 'also  a  number  of  disadvantages.    A  time-sharing 
system  is  required.    Free  time  and  an  adequate  supply  of  terminals 
must  be  available.    The  system  must  be  reliable,  and  easy  to  use. 
\  The  lessons  require  a  tremendous  investment  in  programmer  and       .  ^ 
lesson-designer  time,  .  ^^-^ 

^  '  ♦ 

The  students  seem, to  enjoy  using  the  programs  when  the 
time-sharing  system  ia  stable  and  they  can  find  a  terminal.    They  - 
ask  me  when  programrs  on  the  Bohr  atom,  radioactivity,  hybridiza- 
tion, etc.  will  be  avajllablfe.    I  think  that  with  care  an  effective       .  ^ 
libf^ary      tutorial-dr.ill  modules  can  be  deyeloped  that  will  make 
freshman  chemistry  a  ^rvore  stimulating  learning  experience'.  * 

Programming  Langimges  )^  .  " 


^Anrattempt  is  beinf^'^ma^e  to  mak^  computing  an  integral 
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Analytical  Chemistry  (Continued) 
Name  DescriptiQn 

TITR  Data  reduction  program,,  determipes  thermodynamic  acid 

dissociation  constants  of  g-alanine  frOm  experimental 
data, 

•TiTRN#         Data  reduction  program,  numerical  differentiation  of 
titration  data. 


GAS 


EQUIL* 


HEAT 


BOX 


VOLUME 
ENTROPY* 


NMR* 


Ph/sical  Chemistry 

Calculate  compressibility  of  a  van  der  Waals  gas.  as 
a  function  of  P,  a,  b,  and  T.  ^ 

Calculate  equilibrium  constant  by  minimizing  total- 
free  energy  function,  functiolif.of  stoichiometric 
factors,'  chemical  potentials  and  temperature. 

Calculate  enthalpy  of  reaction  from 
Solve  particle  in  the  box  problem: 


2a(2mE) 


1/2 


.tan 

fvinctipn  of  a^  and  Vo. 


■2(Vo-E)^/V/" 
2E-V0 


Perform. rQgr^ssiorf  analysis  on  EMF  data .as  a  ftoction 
of  molality. 

Determine  partial  molal  volume  from  solution  data. 

Calculates  the  entropy  oh  a  molecule  from  sp^ctrosco- 
'pic  data;  fwiction  of  the  number  of  atoms  in  the 
molecule,  linearity,  temperature,  pressure,  symmetry 
factor,  multiplicity,  fundamental  vibration  frequen- 
cies, mass  of  each  atom, and  the  coordinates   of  each 
atom  relative  to  an  arbitrary  origin. 

Calculates  an  AB,  ABg,  ABX,  AgXg  or  AgB^  NMR  spectrum 
as  a,  function  of  chemical. shifts  and  coupling ^ 
constants. 
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part  of  undergraduate  instruction  in  chemistry  at  Pitt.  The 
freshman  use  PIL  in  a  desk  calculator  mode  as  described  above. 
In  analytical  chemistry  an  optional  problem  set  is  available  in 
which  certaih  chiemical  systems  are  to  be  simulated.    A  library  of 
Fortran  programs  is  also  described  so -that  students  can' check 
their  work  and  use  the  computer  to /explore  these  systems,  as  a 
function  of  the  paramet^ers*.  ^  Another  problem  .sel;  is  available  to  . 
physical  chemistry  students.    In  addition,  two  ho\irs  of^instruc- 
tion  in  PIL  are;  included  as  part  of  the  laboratory-^^ecitatiOn 
program.    This  is  sufficient  background  for  the  students  to  write 
their  own  linear  l^ast  squares  pi^ogram. 

An  upper-division  elective^  '^Numerical  Methods  in  Chem- 
istry", is  given  once,  a  year.    In  this  covrse  the  PIL  and  Fortran 
IV  languages  are  covered  and  several  topics  in  numerical  analysis 
are  discussed  from  a  chemical  applications  point  of  view.  This 
course  is  described  in  J.  Chem.  Ed.  Vj_,  819  (197O). 


Table  II. 


The  current  library  of  Fortran  programs  is  described  irl  ' 


Name 


HA* 


EDTA* 


nternst* 


Table  II.    Fortran  Programs  ^ 

Analytical  Chemistry    '      ,  . 

Description 
« 

Simulate  weak  acid-strpng  base  titration  curve; 
function-  of  Ka,  Ca.  / 

Simulate  EDTA-M^***  titration  curve j  functiok  of  Kf  and 
pH. 


Calculate  EMF  for  MjM^^  electrode  in  M^^***  -NH^  solu-. 


tion;  function  of  pH,  Ki,  Z,  E 


MX2  ^  Calculate  solubility  of  MXp  where  X  is  conjugate  base 

of  HX;  function  of  Ksp,  Kdf  pH. 

MX*  Calculate  sop.ubility  of  MX  where  X  is  the  conjugate 

.  base  of  H^X;  function  of  Ksp,  K^,  K^,  ...  K^,  pH. 

MA#  Calculate  solubility  of  MA  where  M  forms  canpl^xes 

with  a  ligand  L;  function  of  Ksp,      ,  K  ,  ...  Kn, 
.  log  [Ll.  ^         '     '  /  •      ^  „ 
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Name 
ABC* 

HMO* 
' CONTOUR 
LINEAR* 

GUG  ' 


Physical  Chemistry^  (Continued) 
Description 

Calculates  an  ABC  NMR  spfectrum  as  a  fvinction  of  chemi- 
cal shifts  and  coupling  constants. 


Solves  Hiickel  Molecular  Orbital  problems. 
Calculates  orbital  contours;  function  of  Z 


eff 


and  Y*Y. 


Data  reduction  program,  perfoms  'linee^r  regression 

analysis.  '  .  • 

Data  reduction  program,  LINEAR  modified  for  the 
Guggenheim  method. 


*  Others 

IH*  ^  Searches  a  database  of  organic  compovinds  containing  ^ 

■    melting  and  boiling  points,    strong  IR  band  frequencie 
and  Sadtler  file  -members. 

*        .  f> 
'  POLREG*        Polynomial  regression  analysis. 

MATINV*        Matrix  inverter.  '  \ 

JACOBL  '       Hermetian  matrix  diagonalization  routine.' 

''RUNGE  Solves  a  sjrstem  of  differential  equations. 

.SECANT  Solves  a  system  of  nonlinear  simultaneous  .equations 

(Secant  method). 

NEVrrpN#        Solves  a  system  of  nonlinear  simult^ineous  equations 

j^Nevton-Ra^heson  method).  '  .  - 

,  1TINV#  , Interactive  matrix  inverse  routine. 

NONLJN  #  Nonlinear  least  squares  routine.  ^ 

PLOT*.   '  Typewriter  art  plot  routine. 

CALCrtP  #  CjalciShp  plotter  routine. 


Complete  documentation  available. 


In  preparation. 


"^T  ^ 
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Conclnding  Remarks 

4 


I  have  emphasized  here  two  applications  of  computers  in 
chemical  education.    The  one, ,  thiuDugh  CAI,  is  a -tutorial-drill 
facility.    And  the  second  iS  a  facility  for  simulation  and  problem 
solving.  -  CAI  may  be  able  to  provide  poorly  motivated  freshman 
chemistry  students  an  interesting  and  effective  teaching  tool. 
The  simulation  programs  will  allow  good  students  to  manipulate 
chemical  systems  as  a  function  of  the  paraiiieters.  '  These  prograins 
can  provide  a  releftively  painless  procedure  for  studying  numeri- 
cally complex  systems.    Calref ully *chosen  computer-based  curricu- 
lum materials  can  acjd  a  new  dimension  to  chemical  education. 
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APPLICATION  OP- "CANNED"  PROGRAMS  AND  COMPUTERIZED  LITBfATURE^  SEARCHiHC 
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;  By 
Larry-  R.  Sheman 

.  D^partaent  of  Gjiowis-^T        "*  ' 
Worth  Carolina  Agricultural ,  and  Technical  State  iJniVfZ»lty 
Greensboro,  North  Carblina   27ifU.  .  '■• 

INTRODUCTION 

The  use  of  coaputer  technology  In  the  cheiicai  curr^-culua  at 
A  4  T  SUte  University  can  be  divided  into  three  parts,    (l)  The- 
conYentional  use  of  jKrograMming  'in  the  Instruaehtal  analysis  course 
to  Hid  in  calculating  the  data  derived  fro»  complicated  experiaentsi 
(2)  The  use  of  .canned  programs  in  the  nprmal  cheaical  currionluMi 
\ (3)  The  development  of  a  course  in  computer  Uterature. searching 
and  the  retrieving  of  data  from  a  data  bank.  . 

,'  ■» 

There'  are  tKo  computer  facilities  «:vallable  ta  faculty  and 
students  at  A  4  T  State.    There  is  a  Computer  Science  Depaytf^t 
on  the  campus  wjiich  operates  a  Control  Da-t;a  330(\  compute?  (CDC), 
T^  computer  will  accept  Fortran  IV,  Gobol,  and  extended  Algol-  . 
programs  and  is  adequate  for  teaching  computer  programmingj 
administrative  ^iparation,  and  processing  research  dataj  however,  . 
all  data  is  ba^^dh  processed  and  the  computer  facility  lacks  the. 
versatility  d^Ssired  for  classroom,  instruction.    It  has  the 
(advantage  of^ twenty  minute 'to  one  hour  turnover  time  during "the" 
seventy-seven'  hours"  it  operates  .per  week.    The  second  facilitjr 
avaStlable  is  an  I3H  IO50.  tetioinal  connected  to  th?  Triangle 
University  Computer  Ceftter  (TUCC)  at  the  Research  Tirlangle  Park. 
The  terminal  is  located  in  the  Engineering  Building,    TUCC  has 
an  IBM  360  Model  75  computer  which  is  exclusively  devoted'  to 
time  sharing  J  this  computer  will  accept  many  more  languages  than 
the  CDC,  including  ?L/1,  Wat]^our,,  WatFive,  ai)d  conversational,  • 
program  (cps).    Since  almost  one  hundred  termliials  throughout 
the  state  of  North  Carolina  feed  data  in  the  computer,  the 
turnover  time  is  often  five  to  seven  hours 'during  busy  times  in 
the  Uniyerslties  schedules.    However,  the  disadvantage  of  slow 
turnover  time  is  compensated  by  the  vast  data  bank  which  is 
available!  and  since  a  large  number  of  programs  are  available  in , 
conversational  programming,  direct  classroom  instruction  can  be 
given  through  the  terminal.    Due  to  the  differences  in  the 
systems,  direct  translation  from  one  system  to  the  other  is  not 
possible  and  minor  changes  in\  programming  are  necessary,  beside 
changes  in  Job  catd  and  systeh  cards,'  even  when  the  same  language 
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l8  being  iised..  Thus  elementary  student  prograafl  becoae  United 
to  one  system. 

PART  I  '   ■  ■ 

Because  of  the  lack  of  transportability, all  prograaaing  * 
instruction  in  the  cheaistry  department  was  adapted  to  the  CDC, 
Fortran  IV  programming  is  introduced  into  ths  Instrtmental 
Analysis  class.    The  students  receive  four  lectures  on  pro- 
gramming, giving  them  sufficient  background  to  use  the  computer 
to  perform  linear  regression  analysis  and  giv*  statistical 
interpretation  of  their  experimental  data;   Although  the 
students  do  not  leatn  any  sophisticated  programming,  they  hare 
learned  the  fundamentals  and  application  of  data  processing  to 
non- trivial  experiments. 

PART  n 

Other  than  the  instrumental  analysis  course,  the  Chemistry 
^  Department  does  not  offer  its  stuients  formal  programming 

instructions!  the  students  are  expected  to  obtain  the  instruction 
in  other  departments.    In  other  chemistry  courses,  the  computer 
is  used  as  a  "black  box. "   It  is  often  difficult  to  convince 
people  of  the  merits  of  computer  black  box  instruction.  Some 
sophisticated  programmers  feel  the  black  box  usage  has  little 
educational  value,    Some  novices  feel  the  computer  is  a  gimmick. 
Although  botji  arguments  have  merit,  the  people  who  put  forth 
these  arguments  fall  to  look  at  the  purposes  behind  the  computer 
.  usag^.  -  -They  see  the  computer  as  a  tool  in  itself  and  not  as  a 
means,  to  teach  other  subjects.  ••I^  computer  should  be  viewed  in 
the  same  manner  as  the  infrared  (IR)  or  nuclear  magnetic  spectro- 
aeter  resource  (NMR).    Every  chsaist  is  expected  to  know  the  ^ 
rudiments  of  interpreting  inftared  spectra,  but  larely  ualerstand 
the  internal  workings  of  the  instrument  or  the  quantum  meolmnical 
reasons  for  ;the  infrared  bands.'  likewise  most  chemists  can  give 
an  intelligent  discussion  of  magnetic  shifts  of  an  NMR  but  few 
understand,  the  difference  between  homogeneous  and  nonhomogeneous 
•fields  and  fewer  chemists  can  solve  the  exponential  Hsalltonians 
which  ^(xplAin  the  observed  shifts.    like  the  IR  and  the  NMR,  a 
computer  cwi  be  used  to  teach  chemical  principles  without  an 
understapdlhg  of  the  theory  behind  its  use.    Just  as  full  vap  of 
an  IR,or  -NMR  demands  a  spectroacopist;  the  full  use  of  iTcomputer 
(  demands -a  skilled  prggrsmmer.    However,  other  users  only  need  to 
know  what  kind  of  data  they  desire,  and  which  buttonk  to  push. 

'  '  ;     .  ,  QUIZ 

Some  ■school&>,^e  the  Uhiveirsity  .of  Pittsburgh,  have  developed 
computerized  instructional  homework  but  ;the  faculties  and  time 

>  )         •    "  : 
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noctssary  for  thie  type  of  jrogx^amad  learning  are  generally  pro- 
hibitive.   The  program  QUIZ   is  a  coaputeriaed  set  ot  typical 
cheaistry  woW^eae,    It  contains  general  cbealetry  and  quantitative 
analysis  prbbleaa  that  are  worked  Iqr  the  atudent  as  hoaework.  The 
student  worlds  the  problem  like  any  hoaework  assignment,  then  he 
puts  his  answeira  on  a  punch  card  alonij  with  his  name  and  student 
numbitt  the  punch  card  is  delivered  to  the  instructor  who  places 
It  1^  the  pr6gram  deck«    The  computer  checks  the  student's  answers 
♦and  if  "an  ancrtcer  is  cornet  it  identifies  the  question  and  prints 
a  congratulatofy  sentence »    tf  the  ans^r  is  incorrect,  the  computer 
prints  a  warning  that  it  will  only  accept  answers  with  three  signi- 
ficant figures,  then  prints  the  correct  format  for  solving  the 
problem  and  asks  the  student  to  rework  the  question.    The  computer 
also  keeps  a  running  tally  of  the  number  of  correct  answers  sub- 
mitted by  eacli  student  .and  prints  a  grade  on  each  set  of  questions, 
Althou^  this  grade  coxild  be  used  for  computing  students*  girades. 
It. is  not  recorded  because  a  keypunch  error  can  not  be  distin- 
guished frqm  an  error  in  chemical  theory  nor  is  it  possible  to 
distinguish  resubmitted  problems  from  those  assigned  for  that 
week. 

With  QUIZ  the  student's  work  is  corrected  the  same  day  it  is 
submitted  and  herrecelved  tutorial  help  with  any  problem  which 
causes  him  difficulty.    QUIZ  also  allows  the  student  to  resubmit 
incorrectly' worked  problems  and  have  them  graded!  this  would  not 
normally  be  practical  if  questions  were  gradckl  by  the  Instructor.  ^ 
Complicated  redox  equations  are  Included i  the  student  enters  the 
sum  of  the  coefficients  for  reactants  and  products.    Since  only 
redox  equations  are  used,  the  probability  of  the  sum  being  correct 
when  the  equation  Is  balanced  incorrectly  is  mlnimited. 

QUIZ  is  written  in  Fortran  IV  and  can  hold  999  typical 
chemical  problems,    the  student  is  given  a  mimeographed  sheet 
containing  the  problems,    feone  of  the  problems  are  complete, 
'other  problems  have  data  missing i  the  missing  data  is  supplied  by 
the  student,  using  the  least  three  digits  of  his  student  number. 
The  use  of  blanks  gives  some  individuality  to  each  student's  work 
and  decreases  the  copying  of  answers.    The  student  must  use  his 
own  student  number  on  the  answer  cardi  the.  computer  picks  u^  the 
Ifiist  thr^e  digits  of  the  student  number  aM  inserts  il  into  the 
problems  which  is  stored  in  the  program.    The  final  printout  is 
returned  to  the  student  by  his  nuJfcer. 

Because  of  the  additional  time  needed  by  the  student  to 
prepare  a  punch  card,  the  response  to  computerized  homework  has 
been  less  than  with  conventional  method^. 
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HA 

"    3  ■  •  ^ 

HA  ,  a  program  for.  preparing  an  acld-tas©  titration  curve,  io  one 

of  the  most  useful  programs  in  teaching  analytical  cheaistry.  The 

program  was  first  introduced  into  the  Quantitative  Ahalysia  course. 

The  student  enters  the  normality  6f  the  acid,  normality  of  the  ~ 

haae,  number  of  ml  of  acid,  and  pKa  for  the  acid  onto  a  c<Mrputer 

card,  and  the  computer  retittns  twenty-four  points  for  pH  va  ?6  "* 

^titrated  curve  from      to  120^«of  "base.    The  student  then  plots 

the. data  on  a  piece  of  graph  paper  to  see  the  effect  of  a  Change 

in  pKa  on  a  titration  curve.    Since  the  computer  time  is  less  than 

two  seconds  per  run,  it  is  possible  to  have  each  student  prepare 

a  titration  curve  for  five  pKa  values  from  one  to  nine.  Normally, 

a  student  could  not  be  expected  to  plot  a  curve  foJr  more  then  one 

pKa  value  in  an  assignment  because  of  the  tedious  caloulatl^'ns. 

Currently  the  program  solves  &  q^uadratlc  equation  but  It  ^ 
Is  being  extended  to  the  solution  of  cubio  equations  so'  that  |*a 
values  of  greater  than  nine  can  be  used  Kltbout  changing  the 
basic  format  t    This  program  Is  mounted  both  at  TUCG  and  on  the 
local  CDC  system,  so  that  only  a  minimum  of  student  time  Is 
required  In  obtaining  data, 

POLTRA  - 


Anjjther  program  which  has  considerable  use  Is  the  POI/PSA 
program  .    It  Is  a  program  for  solving  polynomlnal  equations  imd 
has  a  tteat  deal  of  use  In -'helping  to  avoid  the  tedious  calcula- 
tlon^rlnvol^ed  In  computing  the  pH  of  a  weak  acid  or  base  whan 
the  weak  electrolyte's  conpentratlon  Is  such  that  a.  significant  ~ 
portion  of  the  electrolyte  has  dissociated  (10^  or  greater),  Ei 
a  complex  pH  probjtera,  five  to  twelve  non-linear  equations  are 
obtained  with  the  same  number  of  unknowns  as  equations.    These  can 
be  reduced  to  one  polynomial  equation  which  is  then  solved  by 
numerical  analysis  j  the  latter  may  involve  hours  of  hand  calcu- 
lations especially  if  the  init:^  guess  is  poor.    In  the  POLTRA 
program,  the  student  enters  the  order  of  the  polynomial,  and  ^ 
the^  coefficients  in  asceMlng  order.  Mth  these  the  computer 
calculates  all  real  and  Imaginary  solutions  to  the  problem.  The^ 
student  can  then  use  the  positive  real  numbers  to  determine  the 
exact, pH  of  the  solution.    This  program  has  ma4e  it  possible  to 
assign  a  large  number  of  problems  which  il^^ust^ate  the  cinnge"' 
in  pH  with  a  change  in  concentration, ^nKa,  or  pKb  without 
requiring  the  student  to  perform  the  iengthy  arlthematic.    It  is 
also  possible  to  illustrate  the  leveling  affect  of  water  on  the 
pH  of  the  solution,    often  the  latter  is  neglected  because  the 
solution  of  the  equations  is  "too  difficult,"  and  the  instructor 


allom  the  students  to  make  Invallcl  aesunptlons  b^ci^uee  the 
aniswero  tiui  not  easily  1»  calctilated. 

'  POLTRA  is  a  good  example  of  xising  the  computer  to  perform 
calc^ulations  so  that  detailed  chesiical  prii^iples  can  he  stX2dled, 

^    '   \      ^    ^  ISIS  ' 

ISIS^  ^is  a  pregxam  for  searching  the  American  Society  for 
Testing  Materials  (ASTM)  infrared  fUe  mounted  at  TUCG.  ASTM 
has  gathered  all  the  major  IH  files  (Sadler,  AIPE,  AHC,  etc«) 
and  has  computerized  the  searching  of  92ff000,  XH  spectra.  This 
ptrogram  has  been  moimted  at  T^CC  and  can  be  reached  through 
remote  terminals.    The  program  Is  most  efficiently  run  if  the 
norband  region  of  spectrum  is  fiirst  determined}  a  no«band  is  a 
portion  of  the  spectrum  where  no  vibrations  produce  a  band  of 
more  ttmn  30^  transmission,  ^  Next  the  vave  length  of  the  IB 
bands  are  determined  and  the  computer  searches  the  file  to 
identify  the  spectrum.    To  compensate  for  differences  in  Instru* 
mentSt  a  "wiggle"  factor  of  0,1  mlcroim  is  provided  for  each 
peak  wave  length.    The  computer  prints  out  the  code  numbers  of 
all  the  Compounds  with  spectrum^  which  corresponds  to  the 
entered  data.    Usually  three  tp  tei^  code  nxmbers  are  obtained,  f 
but  %lnce  more  than  one  file  may  contain  the  same  compound,  the 
"hit"  ratio  is  mpch  higher  than  would  Initially  appear  from  the 
printout,  ^  ' 

In^the  Instrumental  analysis  course,  the  students  obtain 
the  infjnlred  spectrum^of  a  solid  and'  liquid  -unknown  (usually  a 
'  simple  organic  compound  like  DMF,  DMSO,    -napthanol;    They  first 
try  to  identify  the  compound  xising  reference  books, After  a  week's 
work  -  iisually  unsuccessful  *-  they  are  Instructed  in^  the  use  of 
ISIS  and  receive'' the  answer*  i;o  their  unimdwn  in  ^abbut  twenty 
minutes, 

ISIS  has  also  been  used  by  the  students  taking  QUsdltative 
Otganlc  Analysis,    Although  ISIS  was  not  introduced  Into  the 
cotitrse,  the  stiftlents  ^xiickljr  learned  to  use  it,  an^  began  Id^nti-* 
gylng  all  their  unknown  by  IB,    Unfortunately,  the,  instructor  who 
.  taught  the  covtrse  did  not  recognize  the  potential  of  computer 
searching,  and-  fQlt  the  students  had  just  learned  to  '*beat"  the 
course, 

GAME         *  ;  ^  ; ^ 

/ 

GAME  is  a  qualitative  orgauiic  unknown  game.    It  is  preprogramed 
with  organic  compounds  and  forty  qualitative  analysis  tests  for 
jJKfdfBe  compounds «    Each  test  is  assigned  a  cost  which  is  based  on 
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the  tine  it  Would  take  an  analyst  to  perfom  the  test.    The  object 
of  ' the  galie  is  to  determine  the  unknown  at  the  loweet  cost.  Since 
an  understanding  of  the  tests  is  essential  to  identify  the  com- 
pound,, a  ^ceat  deal  of  organic  chemistry  can  !»  learned  Kith  GAME. 

.^Fkculty  members  usually  enjoy  playing  the  famei  however, 
students  sometimes  become  frustrated  because  they  may  run.  fifteen 
^  to  twenty  tests  axid  still  have  no  answer  to  their  unknown.  GAME 
*  hM  limited  usefulness  in  the  North  rhi—c^^^nft  B^Lucational  Computing 
Service  (NCECE)  system,  for  the  students  quickly  learn  thsy  can 
obtain  the  IR  for  their  unknown.    They  punch  the  data  on  a 
computer  card  and  .run  the  ISIS  program  to  identify  the  'unknown.  • 
When  this  happens,  GAME  becomes  useiilss  and  it  is  put  awmy  until 
the  next  class. 

• 

If  one  of  the  objects  of  a  modem  chemical  curriculum  is 
to  teach  students  to  interact  parameters,  GAME  and  ISIS  have  done 
this.    Computer  technology  does  not  only  save  time  and  open  new 
avenues  of  study,  it  also  frustrates  instructors  whose  classiciJ. 
teaching  methods  become  obsolete^  .It,  more  than  anything  else',  - 
demands  that  new  approaches  and  ideas  be  introduced  into  chemical  • 
curriculum  so  that  the  normal  curriculum  does  not  become  trivial.  ' 

PART  ni 

At  A  (t  T  there  is  a  catchall  course  entitled,  "Current  Trends 
in. Chemistry."    In  the  summer  of  1^0  it  was.  decided  to  use  this 
course  t# teach  computerized  literature  searching  and  data 
retrieval. 

The  course  is  a\^phomore  chkmistry  course,,  all  students 
having  completed  at  least  general  chemistry,  quantitative  analysis,  ' 
and  organic  chemistry.    The  majority  of  the  students  have  not  tmui 
a  course  in  computer  programming  and  this  course  is  taught  with 
the  .same  black-b<jx  concept  described  earlier.    The  stxxlents  are 
given  a  very  elementary  introduction  to  computer  usage  and 
encoi^raged  to  ask  questions  about  programing,  but  no  programming 
per  se  is  taught,   .The  studepts  are  instructed  in  keypunch  opera- 
tion for  batch  processing  of  data  and  the  us^  of  a  teletype  for  _ 
CPS  work;  the  latter  is  used  whenever  possible.    About  one  third 
of  the  course  is  devoted  to  literature  search  using  an  uncomven« 
tional  method,    One  third  is  devoted  to  data  retrieval  euod  processing 
and  the  remainder   to  studies  of  employment  and  problems,  in  the 
chemical  industry.    Searches  are  tfot  conducted  to  be  complete  W 
to  be  rapid.    This  means  a  great  deal  of  literature  Aight  be  over- 
looked, but  in  light  that  the  majority  of  our  students  obtain 
industrial  positiolts,  where  useable  data  on  a  subject,  is  more 


iaportant  than  conplete  retrieval,  speed  is  emphasized  over  total 
rscall.    At  first  it  waft  thought  that  the  Computer  Readable  Service 
of  Chemical  Abstracts  (CA)  would  be  ideal  since  it  is  the  largest 
data  bank  available*    Burlington  Industries,  the  only  Ipcal  sub- 
scriber to  this  serd.ce,  offered  their  facilities.    Honever,  they 
discontinued  subscription  to  the  CA  tapes  before  the  class  met, 
Contact  was  then  made  with  Mr.  HacDonald,  the  Marketing  Manager 
at  Chemical  Abstracts,  who  offered  to  seU  a  sample  package  of 
tapes,  programs,  etc.  for  The  package  could  be  mounted  on 

any  computer  aystem  and  be  repeatedly  used,  but  because  of  a  change 
over  at  CA  to  a  standard  distribution  format,  the  package  wou34 
have  been  obsolete  before  it  could  be  used  a. second  time  in  the 
course. 

Since  the  CA  file  is  the  largest  and  most  complete  scientific 
data  abstracting  and  editing  service,  it  could  not  be  neglected. 
The  stuients  were  introduced  to  the  purpose  and  historical  back- 
ground of  GA's  computer  readable  service.    They  were  given  the ' 
cost  of  buying  the  service  and  the  alternate  of  suj56c:ribing 
through  a  data  processing  center.    Typical  "file**  ca^rds  were 
distributed  to  the  stirients  and  an  explanation  given  cxf 'all  the 
Information  on  the  cards,  ftith  emphasis  put  on  the  indexing  search 
words.' 

An  alternate  to  the  CA  search  wm  provided  by  the  Science 
and  Technology  Research  Center  (STRC)  at  the  Research  Triangle 
Park.    Th^y  conduct  searches  on  the  ilASA,  DOD,  TI  and  EI  files  ^ 
on  a  contract  basis  for  universities  and  industries.    They  have 
also  prepared  a  demonstration  file  on  th^  Textile  Industry 
(TSXZy.    This  file  is  stored  at  TUCC  and  assessable  through' the 
rfemote  terminals.    Permission  Was  given  by  STRG  .for  students  and 
instructor  of  this  course  to  use  the  file.    The  searches  are 
limited  because -of  the  amount  of  data  in  tlie  film,  but  illustrate  ■ 
many  important  points. 

'    The  TEX2  file  is  designed  to  search  two  subjects  at  once. 
While^the  file  is  being  searched  for  a  subjeotc  another  subject 
can  be  added  to  the  search  and.  all  references  to  both  subjects 
can  be  recalled  from  the  system i  or  while  a  subject  Xs  being 
searched  it  can  be  intersected  by  another  subject  and  all 
references  common  to  both  subjects  can  be  irecalled  from  the  system. 
Information  which  is  unrelated  to  the  subjects  desired  can  be  pro-- 
grzuimed  to  be  deleted.    For  example,'  a  search  for  references  on 
formaldehyde  resins  would  produce  references  for  both  phenolic 
and  acetone  resins.    If  only  references  for  phenolic  resins  are 
desired,  jreferences  to  acetone  resins  can  be  deleted  'by  instructing 
the  computer  to  delete  them  from  the  list.    However,  of  the  three 
search  techniques,  addition,  intersection  and  deletion,  the  latter 
is  the  least  useful,  because  it  is  often  better  to,  retrieve  ex- 
^  Jtraneous  references  and  manually  eliminate  them  than  to  allow  the 
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computer  to  eliminate  references  whiQh  might  be  useful.  However, 
because  of  the  feize  of  the  TEX2  film  it  has  not  been  programmed  to  ^ 
delete,  ,     '  *  '  ' 

r 

An  index  of  key  ^ords  and  the  number  of  times  they  are  used 
has  been  prepared,  to  show  the  total  number  of  paper  references 
under  any.  subject  matter.    This  gives  the  searcher  an  indj.cation 
of  the  fruitfulntss  of  his  search.    If  a  9ub;Ject  haw  only  three 
or  four  references,  an  intetsection  o;r  deletion  could  eliminate 
all  the  information  on  the  subject,  giving  no  base  for  the  con* 
tinuatidn  of  a  search.    Large  subject  items  (500  to  20,0Q0)  woxzld 
produce  so  many  estrajieous  references  without  intersection  or 
deletion  that  a  computer  search  would  have  no|  advantage  over  a 
manual  search.  ^ 

^  <■ 

The  TeX2  file  which  is  written  in  CPS  has  a  limited  nuMbet  of 
references  and  keyword  and  is  programmed  to  print  out  the  nearest 
word  to  a  misspel^-ed  or  nonexistent  keywords.    When  a  keyword  is 
requested,  the  computer  prints  out  the  dumber  of  refej^nces  and 
asks  how  many  ar^  desired.    Tliis  file,  which  was  prepaW  by 
textile  students,  also  demonstrates  the  probletas  which  ci«C.ari.5o' 
when  technicians  in  one  field  prepare  an  index  to  be  used  in*821 
fields.    There  are  sixteen  references  filed  under  the  keyword  ^ 
Heat  and  two' references  under  Entropy,  a  tern  usually  associated^ 
with  hdat.    If  references  for  Heat  were  requested  and  intersected 
with  a  request   for  entropy  references,  two  printouts  would  be 
expected.    Hbwever,  the  recall  was  zero.  •  The  textile  students^ 
who  prepared  the  file  had  studl^' little^ or  no  physical  chemistry 
and  did  not  recognize  that  Entropy  -should  be  a  s.ubdivision  of 
Heat.    The  failure  of  this  "^search  illustrates  that  even  computer  . 
searches  are  strongly  dependent  upon  i;he  technical  Hnowledge  of ' 
the  programmer  and  indexer. 

]  '  ^  The  TEX2  file  ,wa^  "demonstrated  In  one  laboratory-lectxu:^ 
session.    The  stxKlents  were  asked  to  prepare  a  literature  search 
based  *on  the  file  anc(  execute  it  Ihrough  the  terminal.    They  were 
given  an  outline  and  an  index  and  asked  to  make  additions  to  the 
main  subject,  intersections  and  deletions  (the  latter  ^o  familiarise^ 
the  students  with  the  technique).    From  their  work,  they  prepared 
a  report  on  the  use  of  the  file.    While  the  students  were  preparing 
a  literature  search  on  the  TEX2  file,  a  real  literature  search  on 
electrochetti.3try  was  also  prepared  by  the  instructor  using  the 
NASA  file  as  STRC.    After  the  searches  were  completed,"  the  entire 
class  made  -a  viBit  to  STRC  wh^jre  they  were  instructed  further  in 
literature  searches  and  given  demonstration  of  the  working  of  & 
search  institution,    This  gave  the  students  the  insight  into  the 
vast  amount  of  work  and  expense  put  into  setting  up  and  operating 
an  organization  which  can  handle  the  literatHre  published  in  the 
scientific  fields.    At  the  time  of  the  visit  Mr..  Lockwood,  the 
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Applications  En«;ixiMr,  illustrated  ligr  iisias  tbt  Instructor's 
proparsd  ssarch,  ths  tsdious  proosss  of  dslstinffy  'afttr  rstrioralt 
tho  oxtzmQSous  iBformation  vhioh  naturally  coms  irltb  ths  sMZch 
dttt  io  non^spscifioity  of  tfas  finj^lish  lansQM(^«    Ho  iilustratsd 
this  \tj  oxplaiaing  that  during  a  ssareh  for  rsfsrsnots  on  Vsnstlan 
blinds,  ths  coalputsr  vill  also*i^tritTS  rsfsrsnoss  on  MUnd 
Vsnstisns  and  any  attsapt  to  dslsts  th^  lattsr  runs  ths  risk  of 
invalidating  ths  ssaroh. 
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A  computEr-baseu  simulation  gahe  \ 
;  for'  teaching  experi?iental  strategies,  in  chemical'  rznetics" 

^     •  H.  Warren  Smith 

^  ^  Earlham  Cqlle^e 

Rtclimond ,  Indiana 

'-■  f 

Laboratory  exercises  have  trad.itio,nally  'been  a'U  important 
part  of  science  education ;and  especially  chemical  education. 
They  have  served  to  get  our  students  Jiast  the  world  of  words, 
worcis  in  books  or  word3  in  lectures,  to  Che  real  world  of  mate- 
rials.   In  the  laboratory  the  student  observes  how  chemicals' 
-behave  and  begJLns  to  see  that  thfe  chemical  concepts  presented  in 
lecture  are  useful  in  ^understanding  chemical  plienonrenon.    In  the 
laboratory  .the- student  is  introduced  to  a  variety  of* chemical 
techniques 'and  gains  confidence  in  his  ability  to  manipulate  cer- 
tain aspects  of  nature. 

In  addition  to  introducing  the  student  to  the  ritualistic 
aspects  of  chemistry,  the' laboratory  can  also  be  used  to  d^evelop 
other  skills  used  by < the  research  scientist.    As  the  student 
gains  confidence  arid  ability  in  the  experimental  techniques,  he 
is  encouraged  to  take  af  larger  role  i-n  planning  the  experiment. 
Eventually  the  labpratdry  may  become  open-ended  to  the  extent 
that  the  student  is  givlen  a  statement  of  the  problem  to  be  in- 
vestigated and  asked  toXcompletely  design  the  procedure,  carry 
out  the  experiment  and  i^erpret  the  data.    It  is  these  two 
additional  factors  —  experimental  design- and  analysis  of  the 
data  —  that  we  believe  should  be  emphasized  in  the  laboratory 
training  of  our  students. 

K         Computer  Roles  in  Chemical  Education' 

The  computer*  has  been  used  in'  several  ways  to  improve 
laboratory  instruction.    Most  of  the  student  use  of  the  com- 
puter is-  essentially  as  an  extension  of  his  slide  rule.  He 
learns  to  use  it  to  avoid  the  laborious  computations  needed  to 
reduce  his  experimental  data  to  results  that  can  be  understood 
in  terms  of  the' concepts  of  chemisj^^ry.    For  example,  the  results 
of  a  liumber  of  titrations  are  averaged  and  the  concentration  of 
the  solution  calculated  by  the  computer.    I  would  expect  that  the 
biggest  use  of  computers  by  ^chemists  has  been  in  this  data  pro- 
cessing mode. 

Computers, are  also  used,  in  both  teaching  and  research,  to* 
simulate  chemical  systems  with  a. computer-based  mathematical 
modei.    Complex  reaction  mdlchanisms  can  be  readily  simulated  by 
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a  series  of  differential  equations,  so  that  the  concentrations 
of  any  of  the  chemical  species  in  a  reacting  system  can  be 
printed  or  graphed  as  a  function  of  time.    These  calculated 
curves  can  be  compared  with  experimental  data  as  a  test  of  the 
validity  of  the  model.    Pedagogically  this  approach  can  be  used 
to  help  students  discover  relevant  parameters  of  a  system  and 
the  functional  relation  of  these  parameters.    For  example,  a 
computer  programed  with  a  model  of  gas  behavior  might  be  used  by 
a  student  to  discover  the  ideal  gas  law  by  ©laying  around  with 
volume,  tejtip^rature  and  mass  to  see  what  gai  pressures  will 
result.    A  more  sophisticated  simulation  might  lead  him*to  dis*- 
cover  something  about  the  behavior  of  real  gases.    Hence,  the 
simulation  c^n  partially  replace  a  laboratory  experience*  There, 
is  also  an  increase  in  educational  efficiency,  since  the  time- 
consuming  process  of  data  collection  can  be  omitted. 

Generating  individualized  data  for  "dry"  laboratory  experi- 
ments is  a 'use 'of  the  computer  closely  related  to  simulation. 
In  this  pedagogical  technique,  an  experiment  is  described  to  the 

students  perhaps  they  are  shown  a  film  of  an  experiment  being 

performed.    They  are  then  supplied  with  their  personal  set  of 
data,  which  might  have  been  obtained  from  this  experiment.  The 
student  struggles  with  the  problem  of  data  reduction  and  inter- 
pretation in  much  the  same  way  as  in  the  traditional  lab.  How- 
ever, in  the  case  of  the  computer-generated  data,  the  student  is 
certain  that,  while  the  data  may  contain  "experimental"  errors, 
the  basic  design  and  execution  of  the  experiment  are  such  that  a 
meaningful  analysis  is  possible.    In  the  traditional  lab  exercise 
the  experimental  design  has  been  wbrked  out  for  |:he  student,  but 
his  failure  to  follow  the  instructions  carefully  may  tesult  in 
uninterpretable  data.    For  example,  the  student  who  discards  the 
wrong  fraction  of  a  separation  may  be  required  to  perform  the 
probably  unrewarding  exercise  of  repeating  the  experiment.  The 
use  of  computer-generated  data  again  produces  time  saving  by 
eliminating  the  data  collection  step. 

Another  use  of  the  computeit,  which  has  not  been  as  widely 
exploited  in  chemistry  as  in  economics  and  business  administra- 
tion, is  the  use  of  games* ^    The  clear  advantage  of  a  game  is 
that  it  must  have  explicit  goals.    The  student  knows  when  he  is 
winning.    The  second  major  aspect  of  a  game  is  that  it  has  a  set 
of  rules.    The  rules  may  be  highly  artificial,  as  in  football 
where  foreward  passes  are  only  allowed  from  behind  the  line  of 
scrimmage.    In  applying  the  concept  of  games  to  chemical^ education, 
probably  the  most  useful  games  would  have  rules  very  similar  to 
those  used  in  simulation;    For  example,  a  chemistry  game  based  on 
gas  behavior  would  deal  with  pressure,  temperature,  volume  and 
mass  in  ways  that  are  designed  to  model  what  might  reasonably 
happen  in  the  laboratory.    However,  the  rules  are  under  the  con- 
trol of  the  instructor  and  may  be  modified  to  achieve*  his 
educational  objectives. 


The  rules  relating  to  how  the  game  is  scored  are  particularly 
useful  in  shaping  student  behavior •    The  size  of  the  ante  required 
to  get  into  the  game  may  be  made  dependent  on  the  amount  of  infor- 
mation sought  from  the  simulation.    The  size  of  the  payoff  could 
depend  in  various  ways  on  the  type  of  results  obtained.    By  adjust- 
ing the  cost-payoff  formulas  a  winning  strategy  could  be  made  to 
*Corr^§pond  to  the  behavior  of  a  trained  chemist. 

KNEXP;    A* Kinetics  Experiment  Game 


To  begin  the  kinetics-experiment  game,  KNEXP,  the  player  makes 
a  nutober  of  choices  which  determine  the  design  of  the  experiment. 
To  study  a  designated  reaction,  he  selects  the  initial  concentra- 
tions of  reactants,  the  temperatur'e  of  the  reactants,  the  product 
or  reactant  which  is  to  be  sampled,  and  the  number  and  fjreqiiency 
of  samples  to  be  made.    The  player  must  also  specify  the  degree 
of  precision  desired  on  each  of  these  variables.    The  computer 
then  prints  a  table  of  the  desired  concentrations,  along  with  a 
plot  of  these  concentrations  vs.  time.    The ^student  is  then  faced 
with  the  task  of  analyzing  a  set  of  data  which  contains  "experi- 
mental" errors  that  depend  on  his  specified  precisions.  The 
analysis  is  further  complicated  by  the  student's  basic  experimental 
design,  which  may  or  may  not, lead  to  interpretable  data.    The  data 
may  show  only  that,  in  the  case  of  a  relatively  slow  reaction, 
essentially  no  changes  have  occurred  in  the  time  interval  examined, 
or  that,  in  the  case  of  a  relatively  fast  reaction,  the  reaction* 
has  gone  to  completion  before* more  than  one  or  two  data  points  can 
be  collected.    Less  obvious  difficulties  can  arise  when  conditions 
for  "flooding",  or  the  method       initial  rates  are  not  met;  or  the 
data  points  cover  only  a  fraction  of  a  half-life. 

Problems  related  to  experimental  design  are  seldom  allowed  to 
occur  in  the  usual  laboratory  exercises.    In  most  cases,,  design  * 
errors  are  program:ed  out  of  the  students'  experience  by  the  labo- 
ratory manual  author  or  the  course  instructor.    This  is  necessary 
because  of  the  large  amount  of  time  and  effort  ttiat  couliS-be" 
wasted  by  collecting  yninterpretabl'e  data.     Since  the  student  does 
not  have,  the  opportunity  to  try  various  approaches  to  a  problem, 
including  some  obvious  blunders,  he  does  not  have  the  opportunity 
to  learn  why  some  experiments  give  useful  data  and  others  do  not. 

In  order  to  convert  this  simulation  of  a  Icinetics  experiment 
into  a  game,  a  charge  is  made  for  each  kinetics  run,  which  is  in- 
versely related  to  the  degree  of  precision  specified  on  each 
variable  and  directly  related  to  tke  number  of  experimental  con- 
centrations that  are  analyzed.    When  a  student  reports  his  result, 
in  this  case  a  rate  law  and  activation  energy,  he  is  awarded  a 
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certain  payoff,  which  hopefully  balances  off  the  charges  accumula- 
ted in  obtaining  the  result.    In  any  event,  he  has  a, score, ' which 
tells  him  whether  he  is  winning  or  losing* 

The  assessment  of  a  charge  can  also  be  used  to  improve  the 
simulation.    In  this  case  the  ante  required  would  be  related  to 
the  time  and  effort  needed  to  actually  perform  the  experiment  in 
the  laboratcnry.    This  is  the  service  the  computer  is  performing 
for  the  player.    Hence,  the*  charge  is  gradtiated  to  indicate  the 
relative  difficulty  of  preparing  and  analyzing  solutions  or  of 
maintaining  a  selected  temperature  interval.    The  charging  for- 
mula makes  it  too  expensive  to  specify  all  variables  to  the  high- 
est attainable  precision.'   The  player  is  foq^ced  to  consider  the 
trade-off  between  the  rising  costs  of  his  experiment  and  the 
probability  of  a  useful  outcome  to  the  experiment.  Students 
overly  impressed  with  the  cost  of  precision  have  produced  data  '  ' 
which  was  sufficiently  random  that  it  fit  equally  well  {6r 
poorly)  a  zero,  first,  and  second  order  Idnetics  plot.    On  the 
other  extreme,  by  specifying  very  high  precision  or  a  large 
number,  of  data  points  a  player. may  incur  a  larger  expense  than 
any  positive  payoff  he  might  receive.  . 

The  Programing  Strategy  for  KNEXP 

An  abbreviated  flow  diagr^  of  the  program,  KNEXP,  is  shown 
in  Figure  1.    The  reaction  specifications  are  read  from  cards 
and  printed  for  the  student  to  check.    If  thejre  has  been  a  card 
punching  error,  the  run  can  be  halted  at  no  charge.  Otherwise., 
the  program  selects  the  stoichiometric  coefficients  and  the 
Arrhenlus  temperature-dependence  parameters  from  a  stored  table. 
In  the  present  program  the  reactibns  and  riite  laws  are  chosen  to 
be  reasonable,  but  do  not  correspond  tc  actual  reactions. 

The    experimental  errors  are  simulated  using  the  IBM- 
supplied  scientific  subroutine  GAUSS,  which  selects  a  random 
number  from  a  normal  distribution  with  a  specified  m^n  and 
-  standard  deviation.    A  random  initial  index  for  GAUSS  is  ob- 
^  tained  by  having  the  student  flip  a  data  switch,  which  terminates 
a  loop.  ^  Tfie  exit  value  of  the  loop  index  is  then  used  by  GAUSS. 
In  this  vay  it  is  assured  that  no  two  runs  will  be  the  same  even 
with  identical  input 'iiata.    Using  this  method,  initial  values  of 
the  concentration  of  all  reactants  are  chosen  using  the  student's 
precision,  specif ication  as  the  standard  deviation.    These  values 
are  not  further  randomized  during  the  run..  The  values  of  tempera 
ture  and  the  time  are  selected  randomly  at  each  time  specified. 
The  initial  .concentration,  time  and  temperature  vaJLues  are  used 
to  compute  concentrations  at  a  later  time.    The- errors  resul.ting 
from  analysis  of  the  sample  are  introduce^  by  a  final  call  to 
GAUSS,  which  uses  the  student's  specification  of  precision  of 
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analysis  as  the  standard 'deviation •    Hence,  the  values  of  tempera- 
ture, time  atxd  sample  concentration  varyi  randomly  about  their 
''true"  values  during  a  k^etic  run,  producing  a  spread  of  data 
points  about  the  best  straight  line  on  the  correct  kJ^netic  plot* 
However,  a  lack  of  accuracy  in  the  choice  of  initif^l  concentra- 
tion will  not  effect  the  spread  of  data  points,  if  they  are  from  ' 
a  single  experimental  run. 

The  computer  program  uses  integrated  rate  laws  to  calculate 
concentrations  of'reactants  and  products  at  the  times  specified 
by  students.    At  present  three  cases  have  been  progr^iedt  first 
order  reaction  in  one  reactant,  zero  order  in  others;  second  order 
in  one  reactant,  zero  order  in  otherls;  and  first  order  in  two 
reactants  and  zero  order  in  other  reactant^.    Rate  constants  for 
these  equations  are  calculated  using  the  Arrhenius  f orm  of  the 
temperature  dependence  equation.    Other  more  complicated  rate 
laws  and  sophisticated  temperature -dependence  equations  could  be 
employed  as  the  student's  knowledge  of  chemical  dynamics  becomes 
more  a4vanced. 

The  prd^ram  identifies  the  student  by  a  cpde  number  and 
calculates  his  ante  for  the  run  by  a  formula  such  as  the  following; 


cost    -    I    ...  Q'?^..,  + 


5([A3^)/[Aj^  6(T) 


f       0.05  0.5  . 


where 


[A]    Is  the  initial  concentration  of  reactant  A 
o 

<S([A]^)'  Is  the  precision  of  [A]^ 

6(T)  is  the  precision  of  the  temperature  (°C) 

[A]  is  the  concentration  of  compound  A  at  time  t. 

6 ([A])  is  the  precision  of  [A] 

6(t)  is  the  precision  of  the  time  (sec) 

N  is  the  number  of  samples  analyzed. 

the  fir^t  summation  Is  over  all  reactants,  while  the  second  is 
over  all  compounds  being  analyzed.    The  constant  factors  in  this 
equation  are  chosen  to  give  relatively  even  weight  to  the  various 
factors.    However,  if  the  instructor  wished,  to  encourage  the  use 
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of  the  piethod  of  initial  rates,  for  example,  the  constants  in  the 
f  irvt  and  second  terms  could  be  lowered  relative  to  the^  constants 
in  the  last  term.    In  this  way  the  cost  of  setting  up  a  number  of 
kinetic  runs  would  be  reduced  relative  to  collecting  a  number  of 
data  points  with  the  same  initial  concentrations.    Even  without 
this  added  i^ncentive,  students  soon  see  that  four  or  five  data 
pointd  per  kinetic  run  are  usually  aufficlent,  provided  the  points 
cover  an  adequate  r^mge.of  reaction.-  a  fact  often  missed  by  stu- 
dents working  in  the  laboratory,,  perhaps  because  of  the  separation 
of  data  collection  in  the  lab  from  the  later  data  analysis  at  home. 
Also  in  the  computer  game  the  cost  of -^additional  data  points  is 
more  explicit.- 

In  addition  to  .tabular  and  graphical  output  of  the  4ata 
resulting  from  a  run,  the  data  may  also  be  punched  on  to  cards 
for  processing  by  other  programs.    In  conjunction  with  KNEXF,  a 
general  graphing  program  is  made  available  to  the  stxidents  to 
aid  their  analysis  of  the  data.    The  graphing  program  allows  the 
data  from  the  kinetic  experiment  gan^e'  to  be.  plotted  in  the  usual 
zero,  first,  second  or  third  order  kinetics  plots  over  any  range 
of  time-concentration  values.   .The  slope  and  intercept  of  the, 
least  squares  line  through  the  data  points  are  also  calculated- 
One  student,  in  an  attempt  to  get  more  information  from  a  run, 
wrote  a  computer  program  that  systematically  calculated  sets  of 
^ate  constants  for  various  reaction  orders.    This  program  made 
it  possible,  at  least  in  principle,  to  obtain  the  rate  law  from 
one  or  two  carefully  chosen  experiments. 

The  payoff  for  the  correct  rate  liaw  and  reasonable  values 
for  the  specific  rate  constant  at  25°C  and  the  activation  energy 
depends  on  the  difficulty  of  the  reaction  and  the  accuracy  of 
the  results.    Such  factors    as  number  of  reactarits  and  overall 
order  of  the  reaction  are  considered  in  assessing  the  difficulty  . 
of  the  problem.    Values  within  ±  20Z  of  the  "true"  value  of  the' 
specific  rate  constant  and  ±  0.5  Kcal  on  the  activation  energy' 
are  considered  acceptable  fpr  full  credit.     ,  *' 

Conclusion 

The  type  of  laboratory  exercise  represented  by  this  reaction 
kinetics  game  shares  many  of  the  features, of  the  more  open-ended, 
problem  -oriented  laboratory  experiments.    It  engages  the  student^ 
attention  in  more  than  careful  reading  of  the  procedure.  He 
selects  the  procedure  himself  and  works  out  the  details.  The 
analysis  of  the  data  is  hot  a  pro  forma  exercise,  to  reach  a  con- 
clusion already  ^bvious  from  reading  the  text.    On  the  other 
hand,  it  has  the  advantage  over  many  "research"  type  experiments 
that  the  rules  of  the  game  can  be  closely  controlled  by  the  . 
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instructor,  and  the  tlme-consumitig  process  of  data  collection  is 
omitted.    A  common  problem  with  open-ended  labs  is.  that  after 
spending  several  w^ks  designing  the  experiment,  collecting  data, 
and  analyzing  the  results  the  student  realizes  that  his  design 
was  faulty  or  that  other  experiments  must  be  done  in  order  to 
achieve  his  goal.    In  the  computer  game,  he  would  have  ready  ^ 
^access  to  more  data  and  other  experimental  designs.    In  addition, 
the  instructor  can  present  the  problems  in  order  of  increasing 
difficulty,  so  that  a  student  is  challenged,^  but  succeeds  in 
reaching  the  goal. 

In  comparing  the  values  of  the  open-ended  laboratory  with 
the  game-type  of  laboratory,  the  educational  efficiency  of 
these  pedagogical  techniques  should  he  considered.    Many  more 
situations  in  chemical  dynamics  can  be  presented  to  the  student 
using  the  strategy  of  the  computer  game  and  eliminating  the 
educational  unproductive  data  collection  exercise. 

The  rapid  acquisition  of  data  frees  the  student's  time  and 
effort  for  concentration  on  the  important  skills  of  experimental 
design  and  interpretation  of  toperimental  data.  Finally,  there 
is  an  intrinsic  motivational  factor  in  a  game.  Since  the  game" 
is  played  against  the  system  (nature)  rather  than  against  their 
classmates,  it  is  possible  for  everyone  to  win. 

This  is  not  to  say  that  all  wet  laboratory  experiments/ 
should  be  replaced  by  computer-based  simulation  games,  but  that 
some  mix  which  includes  games  is  defind^tely  appropriate. 


Footnote  *  ^ 

\ 

1.    The  inspiration  for  the  program  discussed  in  this 
paper  came  from  DATACALL,  a  simulation  game  programed 
by  RichardsJohnson  to  teach  research  strategies  in 
psychology  at  Earlham  College.    DATAGALL  was  described 
at  the  Dartmouth  Conference  on  Computers  jLn  June  1971. 
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Read  and  Print 
Specifications  of  reaction 

'lA]^  ±  6aA]  ),  [B]  .  ±  6([B]  ) 

0  0  0  0 

T  ±  6(T),  t  ±  6(t),  aN, 
6([A]),  6([B])  - 


'  Get  A  and  E 
from  Table 
for  specified  reaction 


L 


Call  GAUSS 
get  randomized 
[A]-   and  [B] 

0  -0 


/nc: 


Lncrement, 
time  by  t 


Call  GAUSS 
get  randomized 
T  and  time 


Compute 
[A],  [B], 


Call  GADSS 
get  rand oni zed 
[A],  [B] 


Print 

t,  [A],  [B] 


Compute 
cost  of  experiment  } 


Write 
cost  arid  record 
of  run  on  disk 


Figure  1 

Simplified  flow  diagram  of  KNEXP;  a  kinetics  experiment  gaming 
p,-;o8ram. 
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The  Vector  Space  Theory  of  Matter 

by 

p.  A.  Matsen 
Department  of  Chemistry 
Department  of  Physics 
University  of  Texas,  Austin 


ERIC 


Figure  -1.  "Oh.  Six-Step  Way,  lead  us 

from  darkness  into  light,  and  deliver  us  from 
those  loathsome  beasts  who  would  devour  us.'* 
(Apologies  to  Buddha) 
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I..    Introduction*    .      '  , 

*  • 

The  Vector  Space  Theory'  of  Matter  (V'STM)  is  the  title 
of  a  course  which  has  been  taught  to  superior  freshman  'at  the  ' 
University  of  Texas  at  Austin  for  the  past  five  years,  it 
is  open  to  both  science  and  non-science  students  and 
satisfies  the  freshman  chemistry  requirement.    The  motivation 
for  the  course  is  as  follows; 

i.     to  prevent  the  acquisition  of  myths  about  theories  in 
general  and  theories  of  structure  of  matt»!r  in  particular  ( 
•    -  ii:     to  shorten  the  tlm^  required  for  the  student  to  get -to 
the  forefront  of  modern  developments,  .  •  ..  . 

Hi.     to  teach  superior 'non-science  students  about  scientific  ' 
theory,  "  ^  - 

^    Lv.     to  provide  fo^r  all  students  a  philosophy  of  science.     "  " 
V.  ^  t,o  challenge  those  students  who  have  had  a  superior 
^1(3^  s<"tot)i  chemistry  course, 

vi.     ',o  provide  a  realization  of  the  'new-raath'  concepts, 
vli.     u^  provide  cpmput^'r  orientation  in  a  chemistry  environment. 
."TM  is  concerned  with  micromatter:     atoms,  molecules, 
nuclei,  and- elementary  particles.     The  science  syllabus  is 
given  in  Table  1.    Tl#*  starred  topics- constitute  a  short  course  " 
of  about  twenty  lectures. 

vr.TM  e-nploys  vector  space  (Dirac-Heisenberg)  mechanics 
./hicn  emphasizes  vector  spaces  and  operators  rather  than  wave 
fr.chroedinger)  mecf^anics  which  emphasizes  differential  equations.  The" 
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mathematics,  which  is  required  for  VSTM  is  taught  as  an  integral 
part  of  the  course  and  is  based  on  the  concept  of  a  set.  Three 
examples  of  sets  are  exhibited  and  exploited.    They  are:  ^ 

a.  the  complex  number  fiald^ 

b.  vector  spaces -ovejr  the  complex  number  field, 

c.  algebras  of . operators  on  vector  spaces. 

VSTM  does  not  require  calculus.    The  syllabus  alternates  between 
mathematics  and  science,  a  technique  which  makes  for  considey^^^e 
r^inf orceRient  of  the  learning  process.    The  science  provides 
ex^amples  for  the  mathematics  and  the  mathematics  provides  a 
realization  of  thte  science.    ..  ,      '         .  ^     -    ^  ^ 

.  The  good,  student  has  little  difficulty  with  either 


/ 


the  mathematics  or  its  application  in  VSTM.    He*  does,  however, 
hava -considerable  difficulty  wiih  the  aco^tance  of  the  VSTM 
because  of  ""his  association  of  the  objectively  i^eal  with  the  classical 
deterministic  world.    To  reduce  the  student's  philosophical  / 
problems,  there-- is  also  taught  as' an  integral  pa.rt  of  the  course,  . 
the  ACP  (axiomatic  cum  pragmatic)  episteraology .    This  philosophical 
part  oT  VSTM  is,  for  many  students ,  tjne  most  exciting  part 
of  the  course. 

According  to  ACP,  the  set  contain:|.ng  all  man's 
experiences  can  be  subdivided  into  two  subsets:    the  physical' 
subset  and  the  humana  subset.    Regarding  the  physical  subset, 
ACP  claims  the  following:  '        '  .  , 

a.  the  physical  subset  of  man's  experience  consists  solely 
of  observations  and  theories  about  the  obsiprvations, 

b.  a  theory  is  a  deivc.e  for  predicting  observations,  . 


♦ 

* 

c,    a  theory  is  Judged  by  the  diversity  and  accuracy  of  itg  . 
predictions^  ^  ^  ' 

d',    concepts  like  truth-,  reality  and  meaning  play  no  useful 
role  in  theory  evaluation.    Historically/  the  use  of  idealistic 
value  judgements  have  been  counterproductive  for  science, 
^ Although  we  are  primarily  interested  in  the  physical  -subset,  much 
of  the  epistemology  of  the  physical  subset  gets  ultimately 
transferred  into  the  humane  subset. 

In  VSTM  all  theories  of  micromatter  are  formulated 
in  the  six-step^way  (see  Figure  1).    Questions  about  'terms  of 
reality,  truth  and  meaning  are  declared  off  limits.    We  close 
this  section  with  a  few  remarks  about  step  six  of  the  six-step 
v/ay.    Quantum  numbers-  can  be  inferred  from  experiment*.  To 
predict  quantum  numbers  we  must  select  a  Hamiltonian  (step  1) 
which  commutes  with  every  element  of •  some  associative  (Frobenius) 
algebra  or' a ^ non-associative  (Lie)  algebra.     On  the  basis 
of  the  Wigner-Eckart  theorem,  we  then  predict  that  the  observed  ^ 
quantum  numbers  are  the  irrediicible  representations  .of  the  algebra. 
They  are  accordingly  called  algeb-raic  quantum  numbers  *  For 
example,  for  the  hydrogen  atom        «  .  , 

£aH(4}  =>  n,  the  principle  quantum  number 
,^ZClR{3)      ^',m,  the  azimuthal  and  magnetic  quantum  numbers 

^GolJ(?)      s,  the  spin  quantum  number 
For  olementar^  particles 

car>ri(3)  ^  I  (Isospin),  Y  '(hypercharge) 
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The  textbooks  far  VSTM  are: 

F.  A.  Matserii  Vector  Space  and  Algebra  for  Chemistry  ahd  Physics. 

'    .  Holt,  Rlnehart  and  Winston,  1971.  .  '' 

F.  A.  Matsen,  Vector  Space  Theory  "of  Matte^,  .Vol.  I  (Chemistry), 
University  of  Texas  Co-Op  (1970) j  Holt,  Rlnehart 
and  Winston,  (Scheduled  1972). 
F.  A.  Matsen,  Vector  Space  Theory  of  Matter,  Vol.'ll  (Physics), 
Holt,  Rlnehart  and  Winston,  ' (Scheduled' I972) . 
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II.    Con;pu1>er  Usarge  in  VSTM  (with  Robert  Wesson  and  Barbara  Sapienza) 

In  VSTM  the  computer  is  used  in  thre^e  ways: 

a.  the  laboratory  contains  -an  experiment  consisting  of 
simple  Fortran  programming, 

b.  the  eigenvectors  and  eigenvalues  for  large  systems 
are  computed  with  canned  programs, 

c.  CAI. 

An  example  of -CAI  is  a  tutorial  for  the  computation 
of  eigenvectors  and  eigenvalues.    Given  a  matrix  the  computation 


«   is 'carried  o'yt  aVfollov^:''   '  *      i.  '  ,     '  '  ' 

step  1 .    Solve  eigenvalue  problem, 
Step  2.    Write^  eigenvector  equation  in  ratio  form. 
Step  3.    .Solve  for  ratios. 
Step '4.    Normaliae  the  vector. 
*  Th'e  CAI  is  called  a  tutorial  because  it  presupposes  some  previous 

knowledge  on  the  part  of  the  student  which  is  acquired  by  the  ' 
,  so-called, quick  and  dirty  i-^ethod: 

i.    ,a  twenty  minute  lecture  followed  immediately  by  a  quiz, 
_  ii.    ix  second  lecture  over  the, same  material  followed 
immedi'ito ly  by  a  quiz. 

Those  who  scored  perfectly  on  quiz  i  are  excused  from  quiz- ii 
and  v/hatever  follo^^G.    Those  who  scored  perfectly  on  i  or  ii 
are  excused  from  what  follows.    The  remaining  are  required 

to  take  the  CAI " tutorial . '  -         '     .  ' 
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«  »  ' 

A  flow  diagram  for  the  eigenvalue-eigenvector  tutorial 
beginning  with  step  2  is. given  in  Fi-gure  2,  a  typical  print  out 
in  Figure  3  and  a  glossary  in  the  appendix.    The  liiath  text 
to  which  reference  is  made  in  the  tutorial  is  Vector  Spaces- 
and  Algebras.    The  tutorial  employs  as  a  programming  language 
CLIC  (Conversational  Language  of  Instruction  and  Computing) 
which 'is  translated  and  compiled  as  a  Fortran  program  on  the 
UT  CDC6600-6400  system.    The  p^rogram  at  remote  terminal' is 
executed  with  the  tinfe  sharing  language  TAURUS. 

^ Three  interesting  features  have  emerged: 

a.  The  quick  and  dirty  method  solves  many  student  problems 
and  greatly  relieves  the  load  on  CAI  programming  and  the 
computer, 

b.  ■  The  eigenvector-eigenvalue  tutorial  does  moi^e  than  teach 
the  student  eigenv'='ctors' and  eigenvalues.     It  also  teaches  him 
how  to  study  .applied  mathematics.    Students  often  become 
discouraged  when  they  cannot  see  answers  immediately.  The 

CAI  tutorial  slows  them  down  (it  is  verbal)  and  indoctrinates 
them  with  the  necessity,  of  taking  one  step  at  a  time  and  of 
focussing  carefully  on  each  step.    Further,  it  teaches  them  that 
questions  about  truth,  reality  and  meaning  are  not  part  of  the 
c-omputation.    This  lesson  carries  over  and  we  find  CAI  is  less 
no^esG.ury  Later  in  the  course, 

'C.    There  remains  a  hard  core  five  percent  who  do  not 
respond  to  CAI.    Some  of  these  can  be  handled  on  a  personal 
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tutoi'iui  basis  but  others  are'  psychologically  indisposed  toward 
the^-^;^uction  of  scientific  concepts  to  .steps  in  a  computation. 
■  We  try  to\  cycle  these  people  out  of  YSTM  ii)to  a, conventional 
'  cour;^r^y  the  end  of  the  first  week.        '  .        ,    •     '  ' 

In  summary,  VSTM  makes  ail  students  (sciesnce  and 
non-science)  computer  conscious.    In  fact,  it  has  influenced, 
many  students  in  the  direction  of  computer  scieriee..  •  •"*** 


Table  I 

Territory  Accessible  by  the  Six-Step  Wf.y 


Simple  Orbital  Theory 
t 

a.  Atomic  Orbitals 

b.  Huckel  Orbitals 

c.  Ligand^ Field  Orbitals 

a.    Bloch  (Solid  State)'  Orbitals ^ 

e.  Nucleenic  Offetials 

f .  Quark  Orbitals 

g.  Spin  Orbitals    '  v 

Coarse  Structure  Theory 

^  V 

a.  Atoms  — 

b.  -^  Unsaturated  Molecules  . 

c.  Complex  Ioztis 

d.  Solids 

e.  Nuclei  (Supermultiplet  Theory) 

f.  Baryons,  Mesons 

Fine  Structure  Theory 

a.  Fine  and  Hyperfine  Structure 

b.  NMR.  and  ESR 

*c.    Nuclei  (Isospin  Free  Theory) 
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APPENDIX  I 
GLOSSARY 


initialized  in  driver  program  to  10'  in  SUBR1, 
decreased  1  for  each  request  for  explanation  or 


wrong  answer 


initialized  in  driver  program  to  1X)  in  SUBR2, 
decreased  1  for  each  answer  given  wrpng  twice 

internal  control  in  SUBR3,  works  the  same  way  as 
above      •  /       '  - 

a  control  to  see  if  the/studfent  canperform  the  work 
in  the  subroutines •     ly  is  initialized  0  in  the 
driver  program.     If  th6  student  shows  that  he  has 
been  ^unable  to  performf  the  work,  RXKILL  is  set 
at  0  which  'signals  th^  driver  program  to  refer  him 
to' the. proctor  an4  end  the  tutorial.    If  the 
student  has -done  adequately,  RXKILL  is  set  to  1 
which  signals  the  driver  program  to 'continue. 

used  only  in  the"  c^river  program.    Calls  routine 
which  refers  the  student  to  the  proctor  and  ends 
.the  tutorial.    Signalled  by  RXKILL 

correct  answer 

^alternat^ ' correct  answer  ^»  ; 

wrong  answer  .  - 
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SUBROUTINE  SOLRX 
(main  driver  program  for 
steps  II  and  III)  


I 


COMMON  STEP2,  STEP32,  .STEP31,  RXKILL 
RXKILL  =  KILLl  =0 


3 


I 


"lntroductl<3n" 


"Please  note  equation' 3 In '''the'- math 
text.  Look  at  (x^. "     ■  ' 

  I 


6  Pno^ 
8   iSTEP^2  =  10 


"Do  you  see  hoy  "ir  becomes-  3.5. -lO?". 


.STEP2  =  10 


CALL  SUBRl(RXKILLp 


E 


]9 


"Let's  set  some  specific  values 
for  -  th-e  problem  we  sha*Ll  work. 


\ 


Now  let's  discuss  the  significance  o'r  these 
as  they  apply  to  equation  3. 5. 12." 


"You're  trying  hard 
but  not  doing  well 
I'll  give  you  a  sec 
ond  chance . " 


iSTEPjl  =  lOj  15  ,^ 
:ALL  SUBR2  (RXKILL) 


18 


iKILL  = 

'No       ,  Yes" 


CALL  SUBR3 (RXKILL) 


i: 


<0 


"You  did  poorly  in  this 
section,  but  keep  trying 
and  I  will  be  able  to 
help  you,  " 


I 


KILLl  =  0^ 
VRETURN  J 


21 


"You  did  fairly  well 
on  this  section. " 

2k 


*You  did  quite  well  on 
this  section. -.You  are  to 
be  commended. " 


V 


?5 


26 
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28. 


27 

SOLRXGP  =((STEP2  +  STEP31  +  STEP32.)/3)*,10 
"Your  grade  for  this  section  is--"- 
OUTPUT ( SOLRXGP) 

29 

KILLl 

=  1 

30 


^RETURN 


This  subroutine  is  called  with  two  parameters  -  KILLl 
and  SOLRXOP.  If  RTLLl  is  returned  as  zero,  the  student  could  not 
a^fequately -perform  properly  on  either  it  or  any  of  its  subordinate 
subroutines.  He  should  be  referred  to  the  proctor.  If  KILLl  is 
returned  as  1,  .the  student  completed  the  subroutine  properly.  ■ 

The  other" parameter  SOLRXGP  Is  the  floating-point 
p;rade  point  avprae;e  of  the  stud^-nt  on  d  100  point  scale.  .The 
normal  A,  B,  C,  D,  and  F  grades  can  be  assigned  according  to  . 
this  sc&le,  which  should  be  a  normal,  distribution  for  most 
-/^lasses.  -        •  • 


31 


SUBRl(RXKTLL) 


32 


"Multiply  through  the  right  hsnd  side  by 


p  'C  •"Don't" 


understand" 


35 


I 


D 


36 


F.'xplanatlon 


STEP2=STEP2-1.0 


"Now  factor-out  the 


I 


37 


40 


Explanat Ion 

STEP2=STEP2-1.0 

4  3  "Explain"  J) 
45 


Explanat ion 

STEP2=STEP2-1.0 

IRXKILL=0  - 


"Now  .replace  each  term 
^^KA^iV  (^KA^M-^^  R>,KA)" 


C  ."Go"  J) 


44 


"Your  equation  should  now 
be  equation  3.5.10.  Is  it?" 


7 


C"Yes" 

)49 

T  ■ 

RXKILL=1 

52 

;  t  . 

RETURN 

^54 

"nari't  handle  your  proolom 
logout 

n 


''^^  HETURN  ^ 


Sljy-H2(RXKIi.L) 


"W^-^  know  v;^   have  speciric  numerical  values  for 
'      ^V;Ti'^;J>  's.  What  property  of  the  basis 
'.03  Uo  these?" 


"No.  .^'-'^  pa-^'  44. 
ar'i'l  1  fv  a^-'a!  n.  "■ 


UN  C 


I 


-".T  |"TEPM=STFP31-1 


f;   !'a','".    •  ha  L  liie 
basis  is  orthonormal 
tells  us  that 
<V;I|V;J>  =1      if  I=j 
=0     -If  I/J 


^2 


UN  ^ 
m 


Orthonortn 


lal"     1  6 


"Exactly" 


6^ 


"Look  ar,aln  at  ^.^y.l2.  Honstder 
^  he  operator  brackets  of  the  form-' 

<V;T[x|V;J),  W^.^  have  numerical 
values  t*or  these,   loo.  Where  do 
these  values  come  from?"  ^ 


70 


Wron-^.  "R'^rr^-n-b^-r  what  an 
op^'r^av-^r'  nra^'k^-t  means*" 


rA^'*Mat^rix^  72 


-a-  T'! 

^  hr-  ar)r.w^\r,' 

'  '-r  '1-1 

^  ^ 

7h 


"Rirht.   Our*  matrix  ^M  v^^s 
MS  t  hese  re-sul  t  ;3  *  " 


Tn  .^.1?,  we  v/ill  sum  over  t,he 
*oial  number  of*  basis  elements* 
ri^-rv'^v>   th>-  valup  01*  "n"  will  be?" 


•^0 

:  •  n,  ! 

.  ^:air.." 

I 


77 


"norr-.;t.   M=p  since  our  baj  1.3 
1.  ;^-Mirr.(;iis!ona1  .  "  


mi  d 


tt 

^1 
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<0 


'You're  not  relatinc 
properly  >  Logout  >  'V 


92  f 


6  ^RETURN  ^ 


*^Do  you  think  you  know 
what's  going  on?" 


1 


"Let's  continue. 
You're  doing  f Ine^^  " 


I 


rxEllS>j     Y"   t      ^      (return  ) 


93 


97 


"Then  I'll  give 
you  a  second  " 
nhannp." 


•  CAI  EVEC  OUTPUT 


Student  responses  underlined,    Jlow  diagram  box  numbers  in  margin. 


iU}  A/.E  NC^  GGING   iC  f^trNrOKK  oT LK:^  ^,1 1^  AND   111   OF   FhE  EICENvECTOk 
^^^C;bLE^•  Ao  E/r'LM,  1  NED    IN  r^AGEb  9U   -  V3   CF  YOUk   MATH  FEAT. 
NCIE  ECUAl  ICN  .3  .b.7  AND   LOOK  bP EC  1  F IC ALLY  A,l    THE  EXHKESSION 

KA    I  ^  '  _ 

InCTE   FhAT  blNCE   1   CANNCf  ^^K^INT  THE  LO^.Ek-CASE  "K'S    1  A^  FG/<C£i) 
iC^U-^E  The  ''A'*    IN  f^Y   OWN  VEkSICN  CF   THIS   EXPKESSICN..    f^LEASE  00 
^\'CT  L£I    fHAl^  C  jiVrUSE   YCU  . 
^     OC   rCU  o££  HC,  ELUAIION  3  .  b  .7  *  BECOMEb  EGUAT  1  ON '3  .5  . 1 0  fHKOUCH 
:5l^rL£  ALCEbKAIC   KANI/^ JLAT 1  ON?  .  ' 

'TNEN  A  LinLE  LA^^LANAFIGN  IS   IN  OkDEK. 

ALL  KlCHl  .     AN$riME   YCU  DCN*  i    UNDEkSTAAID  A  bFEP  J .  AbK  YOU 
rof^V.AKE/.  TYPE  ''EaPLAIN".     DIHEKWbE*   TYPE  ••CO"   WHEN  YOU'VE  DONE 
iAE-SFEP.     EapANO   EL'UAFICN  3.5.7    (PEkFCkM   FHE  bUMMAFJON  CPEk- 
3<J^AilCN  AinIl)  WKliL  DCWN   IHE   FErcMb  CN  ^OUt<  HAHEk  )  .     NOW.  MULFlPLY 
IHk'CUGH.CN  1  ht  kIGHI-HANU  blOE  BY  f4^1b  EAPKEbSlON  (WHICH 
LCUAL:5   ONE)--  .  j 

.     \J  \J 
vCa    )  .  /   (  a   )  ,      '   ^  '      '    -  . 

K         -         K    M  '  •  ^  , 

I'LL  WAX  I    F  Cn    r  OtJl   ID  DC    FHlb  .  .  .  . 

.^7  NC/i  FACTCk   cut    FHE   (A   ).       LN  FHt  NUKEkAT Crc  .  .  .  • 
I'l^'C  .  r.cr-LAui  cACh   itnK  CF   fHET  FOnK 

j        '      .  (/*    )    /CS^  )  .  "      -  ' 

'"P  )  '  K    I         r<  K     .      ^  ' 

/.vIlVl    frir:   LAMtLoolCN  K  (K;K) 

1       •  ,    ^  ,  '  -  ^ 

MHib    lo*J.»oi    A  ChAiNCE   Ch    NCTAf  ICXv.  .  .  . 

iiAPL^  It\ .  -  *"     ^      ^   .  *'  » 

rcILUAl  i5no  3o.1u   and   J.b.ll    cdHCJLD  CIvE   YCU  ALL    FHE    INFCr. - 

rC'l  NKr:^.      EOUA  (  I  3N  J.b.lu    lb  P^£CIbh.LY   UHAF    YGU^  ECUAIICN 

'•)bHCiTLu  n:.,  l:;::k*like.    ucnm  ^c^/ny  abouf  the  v/alue  of  ^  yef. 

*  V'LEAoc   f-./^Kr.^'l^HL   NECEbb^^Y   LH^^NCEo   NOW  ANb   LE  F  *  b   CCi\ F  I  N JE  .  .  .  .  ' 

..UH  •  tHE  A-C.JE/v -CAbE  ''K'b'*  b  UP  r'K  E:3b  El)  .    *Io  ''5^ 


« 


^uAC  Lf-o   ..c  t'^^vZ  IMC  bTEK    III,   ECJAFICN  J. 5. 12   IS  THE   IKr^OhTANT  ONE. 


fuZ.\''i  6uak^£  you. 


1T\   C/M)Ef^   TC  V.C^f<r  THE  PrcCBLEK  IN  AS  CLEAk  A  WAY  AS  POSSIBLE^ 
L£P:>  i>ti   tiCy^  :>PECIFIC  VALUES  V.E  WANT  TC  USE  THrcOUCHOUT  THI:> 


1)  CJk  BASIS  '/.ILL  8E--  IVM  W  2> 

2)  ^'E  '"ILL  HA\/E   NC  NEED   FCh    fRE  "A"    INDEA   (  LCUEk -CASE  il' K* 
tjCCK)>   bO   I   WILL  SUHPkESS.it. 

J)         =   I      (.YOU'LL  LEAkN  LATEr^  WHY) 

4)     IHE  BASIS  WILL  BE  OKTHONOKKAL. 

b)     THE  SPECIFIC   EXAMPLE  hATKlX  WE  USE  WILL  BE-- 

"3     2  • 


IN"  rcj/^ 


'J 


LEI'.b  DlbCJSS   THE  S  1 CNI  F  I CANCE  ^CF   FHEbE  AS   1  HE  Y  ,  APP  LY .  T^)  EOU^.TION 
J. 5. 12.  ^      -  ^  ^  '       .  ' 

',.E  KNCa   v.e  HAVE  SPECIFIC  NUKEkICAL  VALUES  TOk  ^  fH£^;'*iVi  II 
.Yi^'t^CLo.  UHAI    INi  riALIZlATION  CONDITION  GIV£S  US    FHE^E?.-  - 
/;,:^Or.l  HCNCKt^AL  BASIS  ^  ^       .  .     '  ' 

OS-EAACTLY  •  '     "         '  ^  '        •/  ' 

(LOCK  AGAIN  Al   E:cUAriCN*;K5.12.   CONSIDErc   THE  OPEkA  T  On '  BRACKETS  OF 
(#^STHE  FO^^K  <v;  1  lAl"  Vrd-^-*-^  HAVE  NUMEKICAL  VALUES  F0«  THEK.  f  DC  .  - 

UHEnE  DC  iH£oE  C/V.E  FkOM?.... 
72  GIvEN  lYAr/NlX 

75'  niGHT ;   fiJr  i<'A(A  IX  ClVE^   US.  THESE  hESULfS.* 
07}  IN  3*5.12   >   WE  WILL  SUV  OVEK  THE  TOTAL  NUKdEK   3F  b  A:^  I^S  "£l.€'KENTi'.  * 
WnCE   IHE   VALUE   OF  "N'^  WILL^^BE  1 ^ 

7,0  NO*  CcUhNf    fHP  M'JKbEK    OF  BASIb  ELE^tAJT b  AND    FkV  AGAIN.  .    /  1 

5^./ Tn5r?<c:c r . -\  =<::  since  cun  ba^i^  ii>  2  oikEnsignal.  ^.  •  <  , 

l/ET'S  CCNTIMJE*    Y3U'r\E  DOING   FINE..       ^efyfO  to  u)^A/C/f  r.<:>A;77A.»yc-6 


7-c  5u8c\:J 
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Introducti^on  to  Computers  and  Computing  in  the     ^  * 
Physical  Chemistry  Laboratory 

M.  R.  James  and  W.  G,  Koch.,  Department  of  Chemistry 
Unlvetsity  of  Northern  Colorado 

Greeley,  Colorado     80631  '  '  * 

Abstract:    A  laboratory  course  has  been  developed. which  introduces 
'the  beginning  physical  chemistry  student^  to  computers  and  computing 
by  considering  (1)  how  computers  handle  data,  (2)  basic  informs* 
tion  on  computer  hardware  and  software,  (3)  levels  of  languages 
by  which  programm'ers  can  communicate  with  t:he  computer,  and  (4) 
programming  and  treating  data  with  both  programmable  calculators 
and  computer  systems.    The  course  begins  with  an  Introduction  to  / 
binary  numbers  and  how  digital  computer  operations  are  based  on 
two^^Jiate  devices.    A  brief  introdu9tion  to  computer  hardware  and 

tftwareiNS  then  presented  followed  by  a  consideration  of  the, 
,*app;lication\  of  a  binary  number  system  to  communic-ation  with  the 
•  comj^uter  inVnachine  language.    This  is  then  followed  by  an 
,^inti:ijductici)n\to  assembly  language  with  emphasis  on  how  these  "low 
^  level"  languages  allow  communication  in 'a  Step-by-step  process 
on  err  near  theV^computer  level."    Emphasis  is  also  placed  <Jn  how' 
jiwch^more  difficult  these  "low  level"  languag>es  ^re  to  learn  and 
'  uise 'tliari"  the  "llhigh  level"  languages.  '^Concurrent  \iith.  the  above, 
\insti?jac£ion  and'. experience  in  using  programmable  calculators  is  ^ 
giVeri.    fbllowina  the  above' material,  instruction  in.  programming 
in  a- ^Tiigti  l;evel" ylanguige  may  be  given  depending  on  the  Background 
.;v'o«f  ;'the  .Students  and  cotuptiter  .facilities  available.    The*  latter 
^kill^s  dte  6hen,  applied  during  the  remainder  of  the  yeaj:  to 
.treating  ^xperimeiiiDal  Aata  acquired  in  the  physical  chepiistry 
labof^totr-y. 
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Introducti-on  to  Computexs  and  Computing  ,  in  the 
Physical  Chemistry  Laboratory 

M.  R.  James  and  W.  CKoCli,  Depi^rtment  of  Chemistry 
University  of  Northetn  Colorado  i  . 

Greeley,  Colorado      80631  . 

In.  organizing  i  laboratory  course  iri  physi<:al  chemistry  for  use  ' 
during  the  first  few  weeks  of  the  year,  one' is  faced  with  designing 
lecture  materials  and/or  "experimexits  which  are  meaningful  and  yet 
db  not  require  a  sigaif icant  background  of  knowledge  of  physical 
chemical  principles.    One  way  this  may  be^ achieved-  is  by  iaiflWi^ily 
lecturing  on  such  subjects  as  treatment  of  experimental  ddta^' 
safety  iTv  the  laboratoryj  etc.,  followed  by  performing  fund^rftaT, 
experiments  such  as  Calibration  of  a  Volumetric  Flask,  Gas  Thermo- 
metry, Determ'inatiop  afthe  Molecular  Weight  of  a  Volatile  Sub-  - 
stance  by  the  Dumas  Method,  etc.    An  alternate  approach  which  has 

'b^en  tried  and  found  successful  is  to  utilize  the  laboratory 
during  this  initial  period  to  introduce  the  students  to  how 
computers  operate  and  how  they  may  be  used  in  treating  experimental 
data  pbta^ihecL  in  the  laboratory.^  .  -  -  • 

With  the  increase  in  the  application  of  'computer  techncrlogy  in 
industry  and  with  the  increased  usp  of  computers  in  seconclary 
School  curriculum,  chemistry  majors^^  graduating  with  a  bachelor's  ^ 
degree  under  a  conventional  college  chemistry  program  often  find 
that  they  are  poorly  prepared  to  , fit  int;o  existing  .programs  which 
utilize  comput;ers  or  lack  the  ability  to  initiate  the  ase  of 
computers  in 'their  respective  occupations,   ^In  recognition  of  this 
problem  a  program  has  been  develd^ed  for  use  in  the  physical 
chemistry  laboratory  which  introduce^  the  students  to  the  ba^ic 
^Qoncepts  of  how^computers  operate  and. how  they  and  programmable 
calculator's  may  be  used  tp  .treat  experimental  *data .    This  program- 
provides  ^meaningful  experience  .during  the  first  part  of  the  y^ar  . 
while  the  students  are  acquiring  a  background  in  physical  chemistry 

'  in, the  lecture  portion  of  the  course.  .^Applications  t'o  various 
experiments  performed  in  the  laboratory  and  problems  assigned  in 
ttie  lecture  may  then  be  made  throughout  the,  rest  of  the  year. 

The  laboratory  course  is  outlined  by  the  following  schedule, 
with  appropriate  variations  being  used  depending  oik  the  student's^ 

.background  and  interests.'  ,         <■     .  • 
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.Laboratory  1  -  Bitiary  Number  Systems  and  Keybpard  Programming  of 
Programmable  Calculators  ^  '  , 

I,    Number  Systems:     1  to  1%*  hours. 

A.  Digital  computers  as  two-state  electronic  devices 

B.  Generalized  decimal  system 

C.  Binary  number  systi^m  ...... 

^        D.    Conversion  of  binary'  fa.  de^imial-'fitnnbers  and  vice  versa*^ 

E.  Octal  notation        '  '  ,  * 

F.  ..  Computer  accuracy  andrwprd  size.-v.        <.  _> 

.  *  G.    Binary  arttfimetiC'.  y^'    .  V 

,     ^    .  .1-./  AdditioXi,'/.      "  '         --'j  ' 

2.^vTW,Q.!6-  conjpl^ment  ao^- subtjr^iitfon  ^ ,  .-v'.^w 

;    *  .  3y''*Muljtipy.jP^tipn^^ 

7  .  Subtraction  >        •  i-"!'  '      "  • 

'       '      *  •         '  : '      .  "  •  ' 

II.    Ke'y.b/>aid  Progxjaiitaing-,<^|f' '^^  to  2-hour 

•  :  t^'stryc'lidti  a^^t*'3emou5^^atiqa^of  ba.^*6'qpnce^ts  of '>i  -^  .' 

'*  '         pijog'ramming;iaVaila)>l)e  cal,?i4i.lator(3^^^ 


Mgjebraic^^toup^  :    _       v  -  . 

.„^V*  .Addrifcion,  »&ubtraQ-tl:6n,  .inipj-^ ip lljjWfl ^     djLyision.//':  ' 
.     b .    ^bonei? t fa t iotr./  ]^  * .  '  '      .  /  ' 

•j;^  ^peC'lal  fuitction  .^oupv/*\  n  *^^ 

/  *  .-vf*    R^;£p^ocal.Sv' fac£0.ria^^^  ^V-^^i*" 

l3»    Square  rd6,ts,  .a<Jriarea*^  etcl,    .♦-'•^  v*"^ 
'         c,.  -  ^Trigohp&etri^  ^functions*.,v\  "     ^  ^^-/l^* 

/d%'*  LogaiflthmJ^9;.'functipd4,-'"etc .  ;V,:    "     •  , 
Dfirarast^atipti  o:^ 'ai'rapVe;/np^ic6grammj^g^^  ! /Vf''- 

1.    Obtaining  aVera&^$/  * '  ^     "  •'<*'y-'^'  ' 

"   ■    ,  •  2»    Csalculatidh  df'vcourse  .pjefjCcfentages/V'-^^   '  '  ^  * 
'     Di    Student  pro^rannpitig  of  siMple  proyt^iris       "  .v> 
•Iv;  Summing  a' ^eries.-.otf  nijmber.a*"'  ^  y'- 

.    '2.'  Conversions*  betiW^en  temperature^  sdales    '  »:  * 

tabqratory  1  -  .Comput^er -'Hardware  an^i.'^Softfware  arid  Cavd  or  Tape 
'   % 'Programming  of  Programmable vtJalculators.  jv* 

*    .  •  ^  '  ; .  '  /  • 

I.    Computer  tfar4waret:   '1^  tTo  2  hours         /  .  . ''^ 

.  A.    Characteristic's  of  general-tmrpose  digital  comptiterVv^ 

B..  Basic -elements  of, -a  cbm^niter  \       .  . 
,   '      !•   Jirput  aud^^dutput  S:^ctions  '  -  • 

2:    Arithmetic  unit /   '       .  •   •      .     ^  , 

3.   .Memcrfy*  \  , '       .  »  ^ 

\/  \   4.    Control  unit  ... ... '  '  '  *  'i;  '  ' 

♦  :>         '5^-  -Irypes  of,  words  ' 
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C.  Basic  machine  operation 

1.  Fetch  phase 

2.  Execute  phase 

D.  Core  memory 

E.  Computer  instruction  set 

1 .  Memory  re f erence  inetruc Won 

2.  Regist;er  reference  instruction 

3.  Input-Output  instructions • 

F.  Input-Output  Systems 

1.  Teleprinter  input/output 

2.  High-speed  punched  tape  input       •  '  ,  - 
3!    Card  readers 

4.  High-speed  punched  tape  output 

5.  Magnetic,  tape  input/output  and  other  peripheral 
devices  where  appropriate 

Cpmputef  Software 

A.  Classes  of  software 

U    Translators  \  ♦ 

a.  Assemblers  . 
*    b •  ^  Compliers 

B.  Control  systems 
C-    Utility  routines 
D.    Applications  programs 

" '       '  '  '  .  '        '  ' 

Card  or  Tape  Programming  of  Prog'riimm^ble  Calculators:     1  to  1^ 
hours        .    •   /      '  ,         '  .    .  ' 

'a.    Instruction  and  demonstration^of  programming  the  calculator 
using  cards  or  tapes 

1.  Preparing  the  card  or  tape  .  '  '  • 

2.  Entering  the  program  into  the  calculator  '  ^ 
B.    Storage'  control  unit  « 

1.  Storing  and  recalling* data 

2.  Special  storage  registers 

a*    Simple  accumulating .registers  '         '  ^ 

b.  Statistical  summations,  6tc. 
,C.''  Student  programming  using  cards  ot  tapes- 

1.  '  Preparation  of  cards  or  tapes  for  programs  assigned  in 

laboratory  I  • 

2.  ^    Apjplication  of  technique?  to  original  problems  .of  .the 

^'student's  chocs ihg 


9 


37k 
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Laboratory  3  -  Machine  and  Assembly  Languages  as  a  Means  of  ' 
"Conmunfcating"  with  the  Computer  and  Advanced 
'  Programming  of  Programmable  Calculators 

* 

I.    Machine  * and  Assembly  Languages:     1%  to  2  hours 

A.  Review  of  binary  number  system  and  the  means  by  which 
computers  utilize  this  system  * 

1,    Demonstration  of  data  numbers  in  machine  language'- 
2»    Memory  locatiohs  represented  ip  machine  language'       ^  « 
3.     ExampTes  of  types  of  instructions  represeiited  in 
machine  language  . 

B.  Sample  program  in  machine  language  ^        *   "  ' 
1.    Development  .of  program  to  ad^  two 'numbers* and  Subtract 

a  third  ^  »  i^..  '  '     ,      .  * 

Storing  the  data  and  program  in  savory    ^  "  V 

.  :■  3.    Executing  the  machine  language  Jirogram  . 

C.  Sampi-e  program  in  assembly  language 

1..  .  Translation  of  *the  >^bove  prog'ram^nt6  assembly     ..  ' 
.       larrjgu'age  usiijg  appropriate  computer  mnemonics 
,    2.    Storing  the.- ilata  "and  program  in 'memory  '  ./ 

..  '♦^■•'3.    Executing  the  assembly.,  language  program' 
.  :^P,    Review  of  translators  and 'the  use  of  "high  level"  language 
in  communicating  with' the  computer       '  ' 

1.  -^  ^Diffioqlty  of' p^pgramratng  in  machirie  and  assembly 

;  .*  ^   'languages  ,        .  ;       ■  :  .-'^       .     ,  ^ 

2.  Concept*  of  **higfii*leve.i"  la^h^uages        1  •  ; 
•3.    ILlustrat?'ion  of "  hoV  the  afioye  pirfigram  ^SfbuTd  i?e  ^ 

,  written  in  an  appropriate -l^high  level"  laitgq:dge 

ir.    Advanced  Prpgrammitjg  of  PrDgTammable 'C'alculators:'    L  to.l%  hourf 

A.  Practice  in '^programming  mathematical  probl^s-       \  - 
^          '  1.     Sine  af. an  angle  given  in  radians  • 

2.    Real\,rpots  of     ^yadratic  equation,  etc, 

B.  Application  of  programming  to  calculations  in  physic^ai 
»     "chemistry      i^-'.    *      ":    .  ^ 

'1.    Boltzmanri  distributioh^of  particles  in  "a  giVen  set 
•  (  'bf  energy  levels  >     v;' '        /   »         . •  . 
2.    Heat  bf  cdmbxistion  foi"  bomb  calorimeters,- e6c"  "  '  ;^ 

Laboratory  4,  etc.  r  Appropriate . Instruction  ^d'"Appy;pati^n-.ip 

*    .Programming  "a  Computer  ;in  ,aj.UHi^"'-teve^l.l|f'*  v> 
Language:  '  Number  of '^^6  p^r^odsh^dep^hds  on;..'"' 
.        f^he  background  of  tlfe*'*ejt;iJ<iBjits/;* 
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The  language  chosen  depends  on  the  background  of  the  students 
and  on  the  computer  facilities  available.    If  a  simple  language 
such  as  BASIC  is  chosen  then  a  fairly  sophisticated  degree  of 
programming  skills  can  be  acquired  by  the  better  students  in  a 
relatively  short  period  of  time.    For  the  case  where  a  relatively 
difficult  language  such  as  FORTRAN  IV  must  He .used,  no  attempt  is 
made  in  the  laboratory  to  give  detailed  instruction  in  this 
language.    Rather,  a  brief  exposure  (no  more  than  3  laboratory 
periods)  is  given  and  the  students  are  encouraged  to  enroll  in  an 
existing  course  in  FORTRAN  IV  sometime  during  the  year  ^aiW  then  . 
to  apply  this  knowledge,  in  their 'laboratory  and  Course  work. 

In  addition  to  the  rationale  given  earlier  it  is  felt  th^  above  > 
approach  is  beneficial  to  the  students  from  several  other  points 
of  view.    One  goal-  of  the  course  is  to  prepare  the  student  to 
treat  data  in  a  programmed  fashion  in  a-bi-level  approach:  (1)- 
progranmpable  calculators  and  (2)  full-scale  computers.    The  material 
onvial^lators  was  included  because  of  the  immediate  applications 
in  physical  chemistry  and  also  they  may  be  t\\e  only  programming 
tool  financially  within  reach  of  some  of  the  students  after,  they 
leave  Sohool.^   This  is  particularly  true  for  those  etoplo>^ed  by 
small  schools'  or  businesses.    The  matetial  on  computers  was  prepared 
with  the  goal  in  mind  of  giving  the  students  a  broader  exposure 
than  is  typically  available  in  the  standard  course  in  programming,  ^ 

For  the  student  who  is,  a  teaching  major  the  course  is  designed 
so  as  to  enable  him  to  work' with  students  who  are  exposed  to 
computer  programming  in  their  high  school  math  courses-and  also  to 
give  him  a  more  fundamental  knowledge  of  computer  technology 
which  will  assist  him  in  giving  the  students  a  greater  breadth  of 
knowledge*  about-  computers  in  general. 

For  Che  industrial  or  research  chemist,  the  course  not  only 
provides  an  introduction  to  the 'treatment  of  dath  by  computer.^ 
techniques  but  also  gives  them  an  introduction  to  some  of  the- 
basic  knowledge  necessary  for  programming'  computers  for  oh- line 
data  acquisition  fcom  Chemical  instrumentation;  ^ 

In  operating  such  a  course  a  great  deal  .of  printed  matter  is 
often  ^vail>able  from  cbmputet  firms,  local  telephone  companies, 
etc:  '  These  can  usually  be  obtained  either  free  of  charge  or  at  ^ 
a  minimal  *cost. 
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We  began  implementation  of' our  '^Distributed' Computer  Network"  * 

r  .  ^ 

in  early  1969  through* generous  NIH  funding.    Phase  I  of  this  pro- 
^ram  involved  acquisition  of  three  laboratory  computers  with  charac- 
teristics similar  to  those  presented  in  the  paper  read,  by  Dr.  Wilkins^ 
After  acquisition  of  the",  system,  our  first  task  was  to  begin  an, 
educational  program  at  ail  levels  to. familiarize  our  undergraduate, 
^  grkduape, students -and  faculty  With  the  mechanisms  by  which  the  com- 
puters  could  be  used  to  augment,  on-going  re^^earch  pralects.  An' 
adequate  test-laboratory  manual  to  support  sucK^.  a, program  Could  not 
be  found  ^o  a  collaborative  effort  to  d'6v'elop  the  necessary  teaching 
.matferia-ls  was  initiated  betv^een  the '^m:hors  df  'this  piper.  "A  setious.' 
effort'  in  this  -dir^sction  has  been  made  possible  through,  the  generous 
financial  support  *Sxf  the  National  Science  Foundation. 

In*the  initial  Vstages  of  the  program,  we  discovered  that  oyer 
95%  of  the  small  computejp  machine  time  was  being  used  for  the  editing  / 
of  paper  tapes  and  assembly  of  programs  from  paper  tape.    This  left    '  ' 
little  time  for  program  execution,  and  greatly. teduced  mean  time  / 
betwe,en  failures  for  our  teletypes  since  activities  of  the  abote'type 


require  extensive  bard  copy.    Part  of  the  teletype  problem  was 
salved  by  writing  an. oscilloscope  based  interactive  editor  prpgraa^ 
but  the  tine  element  still  was  unfavorably  weighted  towards  program- 
development  and;awjay  froQ  experiment  development* 

W0  then  developed  an  assembler  program  that  would  function  with- 
in  the  Univerijity  Supported  computer  system,  but  would  produce  code 
that  could  be  executed  u^ing  the  laboratory  computers*    The  disadvai[i- 
tage  in  that  some  of  the  small  computer  *^ands  on**  experience  has 
been  transferred  from  editing  assembly  to  only  actual  execution  is 
far  outwei^h^d  by  the  facts  that  programs  ^re  punched  on  easy  to  edit^ 
card  decks  and  that  all  stji.dents  have  access  to  .the  assembler  it  all 
times*'  .To.  augment  the  power  of .  our  "off  line"  assembler,  we  haye 
developed  a  simulator  (written;-  in  FORTRAN^!  that  can  utilise  a  large 
computer  to  debug  certai^n  kinds  of  errors  in  the  small  machines, 
programs.    At  least  one  commercial  timesharing  house  offers  a  4i^lar 
assembler-sijsuXator  pack«age  that  operates  to^^ak^'  ai^iy  terminal 
attached.  tc|S  the  syStem  appear  as  though  it  is  attuch^a  to  a  small 
computer*    It  is,  of  icourse,  difficult  to  simulate  the  random  actions 
of  a  rill  time  environment*    However,  now  that  thle  lengthy  process' 
of  assembly  is  removbd  from  the  smaller  system,  much  more  time  is 
available  ftor  progifam  and  hardware  testing.    To  facilitate  the^ 
fatter,  we, have  written  an  interpre'ti>^e 'exeqution  package  that  allows 
for  complete  tracing,  of  program  execution,  .and  provides  the  students 
with  debugiiig  information  that  would  be  difficult  to  obtain  in  any 


other  way.  "A  program  is 'executed  while  the  step-by-step  status  of 
the  snail  machine  is  provided  to  the  student  in  hard  copy.  Operation 
of  programs  ijsing  the  interpreter  has  greatly  enhanced  the  student's 
understanding  of  the  small  machines'  instruction  set. 

,  The  type'  of  hardware  described  in  the  previous  paper  has  proven 
to  be  ideally  suited  for  interfacing  experimental  apparatus  to  the 
computer  in  both  the  research  and  educational  environment. 

The  isolation  provided  by  our  optically  coupled  analog-to  digital 
converters  and  the  pfogranned  gain  features  of  the  front  end  have 
prpven  invaluable  to  the  interfacing  of  our  equipment,  since  tygically 
no  special  grounding  considerations  v^re  incorporated  into  the  design 
of  the  instruments  the  students  have  available  for  interfacing. 

Some  of  the  experiments  are  performed  using  ready  made  interfaces 
and  already  written  programs •    These  generally ^iny>lve  some  particular 
ly,  complex  problem  such  as  simufation  of  Nuclear  Magnetic  Resonance 
or  Electron  Spin  Resonance  Spectra.    The  educational  value  of  such 

.    '.'        ■  •       •     ■  7 

simulations  has  long  been  recognized,  but  in  most  instances'  only 
l/arge  batch  type  computers  are  used  where  the  turn  around  ti-me  may 
be  hours •  Using  a  4K-16  bit  computer  equipped  with  only  a  tWetype, 
and  DAC,  it  is  possible  to  simulate  5"-spin  Nuclear  magnetic  Respnance 
spectra  and  have  the  results  plotted  instantly  on  an;^  stand^ird 
voltage-Vs-time  recorder*.   Programs  of  this  type  have  been  utilized 
in  both  our  qualitative  organic  anarlysis  and  physical  chemistry 
laboratory  courses.    In.  tfje  former,  students  are  given  unknown 
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compounds  with  complex  but'interpretable  NMR  spectta  whi;ch  when 
properly  analyzed,  could  give  the  student  helpful  structural 
information.    The  student  determines  the  NMR  spectrum  of  his  un- 
known  and  then  using  the  simulator,  interprets  the  spectrum  in  terms 
of  coupling  constants  and  chemical  shifts,  which  in  turn^he  interprets 
in  terms  of  his  proposed  structural  assignment.    Students  in* Physical 
Chemistry  Laboratory  perform  similar  experiments,  excejit  here  thie 
design  is  to  teach  something  about  the  quantum  chemistry  oft.'the  tech- 
nique.   One  experiment  involves  observing  Changes  in  NMR  Zlne  pQSitions 
and  intensities  due  to ^changes  in  solvent  polarity.    The  changes  arise 
due  to  induced  degeneracy  of  chemical  shifts  causing  normally  forbidden 
lines  to  become  allowed  and  vice  versa.'  In  fa,ct,  it  is  po'ssible  to 
determine  the  relative  signs  of  coupling  constants  between  some  of  the 
protons  in  the  -compounds  used.    Of  course,  interpretation  of  the  data 
requires  full  calculation  of  several  spectra,  and  when  the  studlsnt  is  , 
able  to  immediately  observe  the  e/fects  of  modifying  one  parameter  of 
another.,  his  understanding  of  the  physical  process  involved  is  greatly 
enhanced,  and  the  time  required  j^or  the  experiment  is  reduced  over 
that  if  the  simulations  were  performed  using, a.  batch  computer;  Ex- 
periments  have  „also  been  performed  utilising  electron  paramagnetic 
resonance  spectroscopy  simulation  VjriLth  similarly  encouraging  results. 
Besides  the  chemistry  taught  by  these  experiments,  the  ^[hands-onV 
.aspects  add  to  the  students  understanding  of  computer  techno'logy. 

Another  category  6f  exi^ejiments  thatjjp'lps  to  introduce  students 


6-58 


to  digital  data  acquisition  makes  use  of  the  general  interface  and 
a  high  level  language  for  data  acquisition  and  data  reduction.  The 
languag€is  used  in  our'  case  are  FORTRAN,  BASIC,. and  CLASS,  but  simi- 
lar  languages  are  aviilable  for  us  with  most  small  computers.  By 
nature,^ these  systems  are  limited  in  acquisition  rates  and  reil  %ime 
computational  ability,  b,ut  a  great  bulk  of  the  experiments  performed 
in  undergraduate  laboratories  are  slow  and  relatively  undemanding . in 
control  requirements'.    Examples  of  such  experiments  are  spectro- 
photometrically  monitored  kinetic  experiments,  automatic  spectro- 
photometer mixture  analyses,  microdensitometer  control,  vapor  phase 
cjiromatography  experiments,  and  several  other  experiments  that  pro- 
duce data  as  some  regular  function  of' time  or  voltage.  Examples 
of  such  programs  are  available  from  the  author  on  request.    The  main 
advantage  of  this  or  any  other  high  Itrbl  language  is  that  students 
c^  perform  useful  experim'e^nts  without  having  to  learn  ^rauch  about 
either  the  assembler  language  of  the  computer  oi\th^^ 
characteristics  of  the  intetface*    This  means  that  one  or  two  ex- 
periments that  utilize  the  computer  can  be  made  to  be  patrt  of  an 
existing  course  without  completely  revamping  the  emphasis  and  basic 
course  context.    Another  important  advantage  of  starting  with  a  high 
level  language  is  that  the  "activation  energy"  of  actually  learning 
'how  to  use/  a  computer  is  greatly  reduced  over  that  imposed  if  the 
Student  must  first  learn  an  assembly  language  and  interface  elec- 

f  1  1 

tronics.  '  It  his  been  our  experience  that  exposure  of  the  above  type 
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actually  serves  to  stinwlati  interest  in  the.  more  b^sic  details  of. 

.  program  generation  and  instrument  in'terfacing. 

^  A  variety  of  other  approaches  have  been  explored, in  oui^labora- 
tories.  and  perhaps  the  above  will  be  replaced  in  the  near  future 
with  some  other  Whing  technique.    This  term  approximately  40 
students  in  a  variety  of  disciplines  are  participatiiig  in  one  Way 
or  another  in  our  computer  educational  program. 

In  conclusion,  we  wish  tp^ote  that", even  though  this  progrwi 

•  is  still  in  its  developmental  stag^  nearly  all  of  the.  plan  out- 
/lines  in  papir  I  have  proven  to  be  workable,  and  after  another  /ear 
we  hope  to  be  able  to  report  the  dmplementation  of  a  fully  debugged 
undergraduate  teaching  curriculum.        '  ^  " 

Acknowledgements:  ^  \ 

Tift  authors  wish  to  acknowledge  the  geirerous'  support  of  the. 
National  Science  Foundation  through  grants  6J-393  and  GJ  44.1..    '  ' 


6-60 


CHEMISTRY^  COMPUTERS  &  GRAPHIC  ItlTERACf  lOr 
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It  Is  not    very    suprlsing    that    Chemistry,,  whose' 
roots  are  located  deep  In  Alchemy,  should  retain  a  high 
degfee    of    graphic  parltlallty.    Words  to  the  alchemist 
served  mor^  to  confuse  than  tQ  cl a r  I  fy    wh  1 1  e    many  of 
-  the  •  dlagrarps  •  of-   chemical     apparatus    Which  appear  In 
these  old  manuscripts  (20)  would    be    acceptable     In  a 
modern     text,  on  Chemistry*    Throughout  I  ts  .development 
*.   Chemistry  has  depende"^d  on  the  graphic  Image*  to  clarify 
,  concepts    (16)*,    and  *  to  c^reate  theoretical  n)odels  (9)« 
In  some  cases  graphic  devices  were  necessary  before  the 
results  of   'experiments    could    be    clearly  understood 
(12, 1*2).      Thus,     the.  modern    cKemlst,     In    using  the 
computer,    has    been    hanjfl Icapped    because    he  tries  tp 
commuj^lcate  In   ^    exclusively    te^itugl     mpde;  ''Ideas 
formulated    pictorlally,    whethe-r"  In  the  Chemist's  mind 
or  on  paper,  are  most  easily    communicated  graphically 

•   (17).  .    •  -  -  m 

The  advantage  of. clear,  visible  displays    has  ^ong 

^  been     recognized    by    those    who    were     responsible  for 
demonstrating  experiments  that  were  not  visible,  to  thore 
than    a     few    people      at      a      time      or  effectively 
communicating    with     large    audience^.    .  Very  Ingenious 
methods    have    been    developed*     for      Increasing  the. 
graphical     visibility  of  such  experiments  (19^,  as  wel 1^ 
as  effectively,   exploiting     the    advantages  .  of  'modern 
audla-visual     equipment,  (2).        Ther^  Is  no  reason  Why 
these  techniques  should  npt  be  exploited,     rathe/  tharw 
Ignored,    when     the  chemist  uses  .the.  computer  In  either 

^     education*  or  research* 

It  would  be  helpful  at  this  ppint  to  cmflne  the 
term  "Computer  Graphics".  "We  reffer*to^the  ability  of 
the  computer  to  display  line  drawings.  Shaded*  pictures 
and  graphs,  under  computer  program  control.  These 
Images  may  appear  on  the  face  of  an  electron  ^  tube,  on 
film,  on  paper  or  ^ny  other  type  of  materliil  which' Is 
user  readable.    We  also  classify  as  "Compubfir  (graphics" 

*  the  study  and  use  of  devices  for  the/lTiput  of  .graphical 
datd  to  the  computer,  such  as  digltaT.  scanners,*  light 

*  pens.  Rand  tablets,  joysticks,  6tc. 

There  Is  no  good  reason  why  the  chemist  has  not 
made,  better  use  pf^thls  method  of  communication^  In" 
'  1963,  Ivan  Sutherland  ( U8)  and  his  col  1 eagues,  des Igned 
and 'Implemented  a.  system  that  vyas  the  first  significant 
demonstration  of  the  use  of  a  graphic  display  console 
as  an  Interactive  Input/output  device.  '  This  system, 
caHed  SKETCHPAD,  was  designed  for  applications  related 

,  ■   ^  385" .  .  ■ 
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to  computer-aided  <iesign,  however  it  efCectivel/  showed 
the  ut'Uity  of  such  a  device  for  a  wide  range,  of 
computeV^or iented  problems.  The  graphic^  console  had 
the  dual  fhnction.  of  serving  ^s  an  iftput/output  device 
for  'pictorial  and  aphanumpric  da'^ta,  as  well  as 
contfQl  fng  th?  ^  sequence  operation  df  the  computer 
program.  TKdadvan  tag.es  and  past  accompl  ishments  of 
such,  devices  are,  what  we  will  exp'lore  in  this  paper. 
Hpwevpr,  before  we  examine  the  advantages  or 
"Interactive  Graphics"  we  should  explore  the  field  of 
"PBSSjve  Computer  Graphics^'. 

fhose  Input  devices  which  operate  on  static  types' 
of  data>  and  .those  display  devices  which  generate 
irreversible  imag4s'(once  a  line  has. been  drawn,  it  can 
not  be-  ei'ased),  are  classified  as  "Passive  Graphic 
Dev  ices".  These  dev  ices,  of  wh  i.ch  1  ine  prTnters; 
plotters  aad  jDhotograph  ic  .devices^  are  representative, 
are  "al  so  *refered  to  as  "hard-copy"  output  devices. 

'  Many-  computer  users  have  discovered  that  the  J  ine 
printer  can  be  *  very  effectively  used  as 'a  plotter 
(10,39, .  While  %he  precision  is  limited  by  the  size 
of  a  print- Qhara'cter, ;  this  dev  ice' produces  plots  that 
are  crude,  but^  adaqua  te*  for  such  thfn&s  as  histograms, 
1  ine  spectra^  scatter  ^diagrams,-  and  ^s  masks  ^  for  hand 
dray^  con-tour  d.iagVams ^-        "  . 

Digi'tal  incremental  plotters  ^llow  the  user  to 
employ  commands  that  control  thfe  movement  of  a  pen  over 
a  sheet'  of  paper  In  any  one  of  eight  different 
directions"  while  •the  pen  is  either  raised  abovie  or 
touching  thd'paper  surface.  Software  that  uses  these 
"incremental"  .commrands"  to  generate  app  rox  ima  t  ion  s.  <o 
stra  igh\  .lines  .(vectors) ,  is  all  that  is  reall/  needed 
to  use  such'  a  system  (5>35).  However,  in  practice, 
morie  sophisticated  software^  is  usually  required 
(231,52).  The^ze  of  the  smallest  possible  incremental 
movfarnent,  an  i  t;efn  that  determines  the  precision  of 
pi  dtted  .output,  .is  a  major  fac  tor,  in  the-  cost  of  the 
plOtl^er.'  Flat-bed  plotters  ^re  usually  more  exqensive 
thgn  the  cyl  indrical  type  and  ban  often  handle*  larger 
Vlo^ting  surfaces^  '  Remote  plotting  terminal^  are  now 
available  wh-ich.  can  be'placed  at  the  same  location  as 
an'  in  te raci  i  ve  ^al  phanume r  i  c  . te rmi  nal  to  p rov  i  de  a 
hifghly  satisfactory,  r^ore  interactive  type  of  graphic 
display.  "  •  ' 

The  final  type  of  passive  graphic  display  is  ^one 
tjna  t'  uses  a  camera  and  f  i  1  m'^^assembl  y  to  record  the 
yi'^p)ay..  *The'  major  devices  in  this  class*  are  units 
designed  for  miorof.Mm  output,  tha^t  wj  1  1  also  accept 
^6mhi,  35mm,  70m'm^or  lOSmm  film.  These  units,  called 
Computer.  Output  Microfilm  (COM)  devices,  are  new,  quite, 
expensive  and  orVly  available  at  a  few  Installations 
(11,26, 3t+, 37,^^5)  ^  However,  many  of  these  unit?  operate 
off-line  fronp  a  computer,  accepting  theiV  commands  fron^ 
a  .computer  generated  magnetic  tape.  •  ^  Thus,  at 
instal  la  t  i.ons    where     such    devices    are  not  ava  j  1  able. 
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tapes  could  be  generated  .  using  standardized  software' 
and  then  printed  at  a  service  bureau  type  .organ  Izat  lorr. 
Such  a  service.  Is*  now'  being  created  at  Brooklyn 
Polytechnic  Institute  (6).. 

Among  new  hard-copy  devices  that  will  *  probably  'of 
their  ability  to  pr^o/Ide  a  more 'effective  man-machine 
IhterfBce.'  They  communicate  In  -graphic  ^  mode^  -  a 
universal  shorthand,  language  that  can  -be  used  to 
compactly  represent  the  ^ery  1  a  rge^  vol  ume%  of 
Information  that  the  computer  Is  capable  of  generating. 
However^  they  operate  ,In  what  Is;  essentially'  a  batch 
environment/  with  the  usual  l9ng  delay  be tween • st Imul us 
and  response.  '  Interac 1 1  ve * ' graph tc  devices  allow^for 
piore  natural  communication  with  the 'compute  r\y.  where  the 
uier  can  hajidle  those  aspects'-of  ^.the  problem  where 'gqod 
computer  algorl  thms  do  not  je^xist.  These**  Interactive 
devices  exist  In  tf^ree  different  conf Uurat lonS/  w\th  a 
wide  range  of  system  response  '  j»haracterl^tlcs  -and 
pricfes.  .  The  configurations  start  with  the  least 
expensive/  consisting  of  a stand-alone  dedicated 
computer  '  that  shares  its  ^  memory  with  the  .graphic 
dJsplay  unit.  -  The  Intermediate  o^pnf  igurat  Ion  Is  one  In, 
which  a  seperate  buffer  ^s  available  to  handle  the' 
d'Isplay  Information.  The  |nfost  expensive/  highest 
performance  system  consists  of.§  large  computer  In  a 
network  with  several  smaHter  computers/  where  the 
satellite  'computers  proylofe^^thfe  display ,  ^-formatting 
function/  and  the  large  ^computer  provides  the 
1  arge-scale/  ^oph  Is  t  I-ca ted  numerical  ca'lculat  lon^  and 
bulk  secondary  storage  capability..  This  Is  j  the  best 
perfopiance  system/  wl  th  q^lck..  response  to  highl  y, 
complicated  requests.  All  ifof  these  systems  are 
presently  qui-ts  expensive^  but  with  some  modification 
they  can  provide  a  high  degree  of  ^graphic  Interaction 
at  ^'n^inlmal  cost.  The  statement/  Often  made  In  the 
past^.  Chat  most  users  coul<l  not  afford  computer 
graphics  be'cause  •  t+ie  equfpment  was  too  expensive  and 
the  software  too  compricated  is  really    no  longer 

true^  and  the  converse,  is  probably  more  correct,  that 
one  can'i  afford  not  to 'use  computer  graphics,  just  as 
one  can/t  afford  not  to  use  the  compAJter. 

The  most  sophisticated  of  the  three  types  of 
interactive  display  devices  is  the  Cathode  Ray  -Tube 
(CRT)/*  where  under  computer  control  an  electron  beam 
traces  the  displayed  iiT\age  on  a  light  emitting 
phospher,  ,  rewriting  the  imag-e  at  a*  rate  tKat  will 
maintal.n  the  light  level. ^  This  operation  known  a^ 
"refresh  ing"  _  must  be  performed  from  15  to  6Q  times  per 
•second  In  order  to  maintain  a,  •  "f  1  icke  r-f  ree"  display. 
This  constant  regeneration  of  the  Im'age  on  the  tube 
face  Is  the  reason  for  the  high  demand  of  the  ,^splay 
on  computer  resources,  as  well"  as  the  reason  why  one 
can  create  dynamic  Ima<jes  on  the  tube  face.^  Units 
based  on  this  type  of  system  are  still  quite  expensive, 
but    where    available  they  have  proven  cjulte  productI>^e 
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(l,7,2U,!28,29,Ua,U7,5fi)      ^     ,    ^  •    .  ^ 

A  second  type  of  display  unJt  Is    the    Direct  View., 

•Storage  Tube  (DVST)/  where  the  unit  Internally  mantalns 
.the  Image  p)ace^  on  the  faoe  of  -the  tube  far  many 
hours.  This,  typ^  of  display  therefore  requt,res"^  much 
.  Ifess  'of  the  computer's  resources^'  and  can  therefore  ^ 
share  ,a  comOUtaP  among  many  such*  'graphic  displays 
The  graphic  Image  Is  not  -as  dynamic*  as  wlthj 
theiCRT'^  since  with  most  tubes  the  complete  Image  must"^ 
be  e rased  before  a  1  l ne  may '  *  be  moved.  ,  Th  I s  Is 
gradually  *  changing  ^  as  tiew  storage  devices-  are 
developed/  among  which  Is  the  "Plasma  Panel^.'  created  As 
part  of  the  University  of  llllnols's  Computer  Assisted 

'  Instruction  Project^'  Project  PLATO  (U).*  Both  the  CRT 
^nd  the  DVST  are-^calle'd  "Qal  1  Igraph  tc"  or  "Random  Sca-n" 
display  devices^  because  the  -writing  element  ^moves 
across  the  face  of 'the  screen.  In  a^  random  fashion  -under 

^program^  control.  *.  The  third  type  of  display  device 
operates  In  a  different  manner.      .  i  ^ 

Thfs  devl'Ce^ls  cal  led  a  "Raster  Scan"    display  ^  and 
operates,     on      the      same    principles    as    the  common, 
tel  evjs  loQ.  consul  e ;  ^  This  unit'  must  also  be  refreshed^ 
but  It  will  cover  every  spot  on  the  tube -face  with  ^ach  - 
sweep*      Tlie  Information  which  must  be  *  suppl  l.ett  to  this 
type  of  display  device  Is    whether    the    electron     beam  ^ 
should    l?e    on    or.    off  at  any  point  In  the  scan.  This 
device^  ■  while  ^  requiring      a      wider      bandwidth  of 
Inforntet  I  on      to      suppVr^t.     the     refresh     rate^^  takers 
advantage  of^a  mass  produced  technology  that  provides  ^a 
display  device  .(a  television  set)  at  a  very    low  price 
(32).   ■  Jhese    devices    can^  also  take  advarttage  of  the 
cl  osed  -  c  I  rcul  t     teT     I'S'ton  .  systems  "that   *are  being 
developed    ah*d    used    on    many    college    and  university 

'  campuses  (22/56f).  *  ^ 

The  last  two  classes  of    devices    provide'  computer. 

^^raphfcs    at    a    very    .reasonable  price.     Both,  types  of 

'units  can  proylde  Inexpensive  storage  of    the  computer 

.generated  tntage  within  the  display  device^  permitting  a 
single    computer    to  drive  many'graphic  consoles."  -What 

-^"then  afe  some  of  the  possible  applications  .  of  these 
graphic  dlspl'ay  devices  to  cheml.cal  probl  ems?  The 
easiest  ansrwer  Is  to  observe  some  of  what  has  already 
been  done;.-* 

^    In     the    area"   of    Molecular    Model     Building  ^ 
Man  I  pul  at  Ion     sTome'of  the  most  gerreral  graphic  handling 
systems  have  be^n  devised*    Thl-s  Is  an    area     In  which 
all  'branches  of  Chemistry  have  a  common  Interest.  Much 
of    our    understanding    of  chemical  pr Inc  Ipl es' has  been 
ot)talned  through  the  creation  of     Intricate    models  of^ 
chemloal     systems    and  we  have  often  found  ft  necessa^f^ 
to -bulKk  elaborate    devices     In.  order    to    obtain  the. 
Information  ,we  dfeslred  from  the  available  models  (31) • 
.   •Still  maiTV  .  chemists    find"    fault    with     the  available 
'-models    (U3")    and  create  new  more  Intrtca^te  madefs  that, 
will   display  the  Information  chey  desire.    The  computer 
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provides  a  bseful  facHIty  /or  building  models  bebauSe 
mdst  of  the  available  computer  graphics  systems  contain 
commaiids  which  are  ve/y  similar  'to  the  type  of 
opera t Ions,  wh I ch  one  would  J  Ike  to  perform,  on  molecule's 
(rota^te,  t ransl a te /  *move  polri^^  etc.);  ^  Systems  have 
been  designed  -to  assist  tn^the  synthesis  of  complex 
organic  mplecules,  through  their  computer  ^graphite 
manipulation  (7^5S.),  and  others  to  help  *ln  the 
understanding  and  construction  ^  of  large  biochemical 
models  '(28/29/42).  Such  systems  allow  the  user  to 
control  the  complexity  of,  the  system  while,  retaining 
the  full  mathematical  model    Internajly  In  the  computer. 

In  a  slml  1  ar  inanner/     computer    graph  tcs    has  -been 
used    to    aid  the     Identification  of  molecules  for 

computerized  Information  Retrieval  systems.'  Graph  l.c 
units  "Have  been  us^d  both  to  aid  tn  the.  Input  (2l*)"*and 
display  (5^*)  of  chemical  ^  data.  In  terra.s  of  the 
ellmlnatl6n  of  user  errors^  such  ^  Interfaces  are 
unmatched.  ,  •  ' 

The^most  Interest Ing 'of  the  various  graphic  systems- 
are  those /Invol ved'  In  the  Simulation  of  the,  real  world. 
Such. a  system  can  be  used  to  either  teach  or  validate  a 
chemical  theory  tinder  Ideal  conditions  of  control'  (47)\ 
Simulations  can  ^  be  \  conducted  wf.th  or  -without 
•simplifying  approx  In^atl  ons,  as  a-method  of  Validating 
the  accuracy  of  such  approximations  (13).  Experiments 
can^  ahso  be  conducted/  using  the  computer  to  generate 
the  data/  where  It  Is  notf||||ossible  to^actually  perform 
an  experiment  due  t;>-^,the  high  cost/  la^k  of  equipment/ 
or  shortage  of  time  /( l^Ov  1^*/ 46)  .  \  Data  can  also  be 
obtained  from  "theoretical"  experiments/  such  as 
obtaining  plc't4jres  of  mol  ecul  ar  ^orb*l  taTs  (41)/  as  well 
as  experiments*  ^  performed  ^  under  *-"lmposslb1e" 
experimental  conditions  (440.  Th'e  application  of 
graphics  provides  data*  In  -  a  fo rn]  -  similar  to-  that 
obtained  "^frpm  actual  experlm'entS/  as  well  as  In  a 
compact/and  r^adablp  format. 

.A  common  problefn  In  the  »  generation  .  of  ^  such 
slmulatlon^^.^  Is  that^  the  calculations  necessary  for  the 
slm'ula  t  Ion  cWn  not  Be  performed  In  "real --t  Tme".  One 
solution  »  tb  this  Is  to  use  an  Analog  Computer  to 
g.eneraxe  the  data  (33).  ^n  cases  where  the  problem  can 
be  expressed  as  differential  equat-ljonS/  such  as  In 
kinetics  (18)  or  quantum  theory/  solutions  ar6  easily 
p^rogrammed  and  dempnst^a.ted.  Due  to  t;he  parallel  and 
repetitive  ^  architecture/  a  normal  laboratory 
ofec  1 1 1 oscop? 'can  function  as  the  display  devlcV.  "  fiapld 
experiments  performexi  in  this  fashion  are  very  useful 
as  teaching  tools  (8/49)/  where  many  parameters  may  be 
a.djusted  and  the  system  observed^*  under  a  variety  of 
conditions. 

Another  solution  to  the  time  frame  problem  Is  to 
produce  Computer  ■  Genera ^ed  F  Mms>  The  advantage  of 
such  films  In  terms,  of  exactness/  clarity 'an'd  educator 
control     has    been    expressed    elsewhere  (21/26/45/57). 


These  films  do  not  have  to  be  of    professional  quality 
(although  many  a.re  as  good  or  better),  they  may  even  be 
ma-de  by  stgdents  In  a  1  abora  tory,.  (30) ,  they  must  simply 
make  •    the    desired    concepts    cT;^ar    and  Interesting. 
Examples        f  1 1  ms  .of  a  chemMcal  rf^ttrrj5  that    have  ^been' 
made  by*sticn  a^  process  are  ones  showljig  the  vibrational 
states   •of    a    molecule  (3'*)^  the  beh'avlor  of  molecular 
-orbltals    during    chemical     reactjons     (50),    and  the 
dynamics    of    m.olecular^  collisions  -(55).    ^  number  of 
Computer  Movie  Languages  have  -beerh ^developed  (51,53)  to 
assist*  In  the  production  of  simlla.r'  movies.      Films  of 
this    nature  can  'also  help  us  -underSotand  the  results  of 
.  phenomenci  thgt  occur  at  very  hlgK^^r  very  low  speed,  or 
that  occur. In  r-eglogs  where  we    caip    not    obsei^ve  them 
(3^38)'.     "      ^        •  •  1        ^  . 

/Knew  era.  fn  education  can  be-^DpenecJ  for  both  the 
student  and  the  Jeacher  when  thfe' computer  Is  brought 
In^o  the  Classroom  as  an  Intgractjlve  D.Igpl^y  device. 
Accura  te  3-d  I  mans  i  onal  '  rep  rej&€!|p  tat  lons^  Incl  ud  I  ng 
stereograph  Ic,  'and  perspective  jjrnages/  are  possible 
thrbtJgh  the  mathematics  of  geojn^t.py^  Jhp  teaofter*  Is  no 
longer  restrained  by  the  llmlO  afj-hls  artistic  talent, 
.but  by*  his  Imagination  In  .thejfcrea  t  Ion  of  effective 
displays.  Whether  the.  sy^fitem  Is.;^.  used  for  classwlde 
display,  or  as- part  gf  a  Comp^ut^f  Assisted  Pn  struct  I  on 
system,  the  Instructor  Is  provided  with  a  n.ew  *d  Imens  I  on 
in  educational,  technology.^  He*'  can  .  provide  more 
accu'ra'te  dlrectix)n,'  qvew  wi  th"  l.arge^classes.  In  the 
a  reas  of  studeri t ;  In  teract  ion;  (3) ,  man  Ipul  a  1 1  ve 
experiments  (221,,  and  the  c]^ss* -feducat  lonal  pace  f^>. 
As  yetttnly  the  surface  has  he^p  .scfatched  In  this 
Important  appl  ica  t  Ion.  ,of  computer  and  graph  ics 
part  ic  I  pat  I  on,  but  the  future  Ts-^^very  encourag  Ing . 

A  final  appl  Icat  ion-  pf  gjraph'ics  Is  In  the  area  of 
on-fine  Acquis  i  tion.  Cont  roT^vaad  Hl^splay  q1  Pa^a 
obtained  from  chemical  experiments  (15)."  .As  examples, 
such  technlqnes  have  or  will/  be  applied  to  x-ray 
di  ff  ract  ion  ( '♦'O) ,  &as  ch  romatography,  .  mass 
spectroscopy,  •  NMR,  ESR,-  and  a  ya  riety  of  other 
instruments.  The  graphic  form  <^f  the  display'  allows 
the  exper  im6n  te r  to  accura tel y  mon I  tor  and  make 
Intelligent  decisions  about  the  experimental  systerrr. 
Keyboards,  switches,  1  lg+rts'v''and  other  one-diment  lonal 
i  nd  ilea  tors  do  not  prov  i  de  suf  f  Ic  i  en  t  ^  da  ta  for 
compl  iqated  experimental  systems.* 

.  The  ^a±>ove.  has  been/only  a  partial  look  at  what  has 
and  will  be  done  with  Computer  Graphics.  This  will  be 
an  a*«;^a  of  iritt^nse  dev^^Topnen t  ov6r  the  next  few, years, 
and  no  chemist  cen  affprdViOt  to  be  aware  of  and  to 
participate  In  th  i  s-  exc  I  t  i  ng  field  of  experimentation. 
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Communi cations ,  Networks  and  Program-  Exchanges  . 

Introductory  Remarks  by  the  Chairman 
/        •  ^ 

♦         ^  David  0,  Harris       •       -  . 

<  » 

During  the-  decade  of  the  1960's,   the  primary  emphasis  in 
computing  from  the  chemists^  point  of  view  was  toward  the  develop- 
ment and  utilization  of  compu^iiig  technology  for  the  solution  of' 
chemical  problems,  *  As  a  .resul.t  of  this  effort,  hardware  and  soft- 
ware systems  have  become  available  which  are  appropriate  to  many 
"areas.of  chemical  research  and  education.     The  co.st  of  the 
development  of  these  systems  both  in  terms  of  time  as  well  as 
dollars  has  b^een"high  and  therefore  it  is  impo.rtant  tl\at  the  ^ 
results  of  this  work  be  made  available  to  as  large  a  community  ns 
possible.     In  many  cases  it  is  only  necessafy  tojjjrovido  a  pro- 
spective user  with  a  card  deck  or  tape  which  he  can  run  on  his 
own  computer  system  WhUe  in  others  it  may  be  necessary  to  provide 
considerable  assistance  at  a  more  basic  level.     There  are/  however 
other  more  fundam6ntal  problems'.     l\m  sure  that  many  if  riot  all 
-of  the  participants  at  this  conference  have  had  the  experienco  of. 
hearing  described  applLcations  programs,  hardware  and  software 
/systems  which  could  be  of  vnlue  in  either  yoijr  research  or  educu- 
tional  activities' ,but  have  found  on  closer  inspection  tha-t.  it  is^ 
neither  possible  «or  economic  to  use  them  at  your  particular 

X  faciltiy: ■  ,  '  •  . 

In  the  early  days  of  computing-  it  might  have  been  reasonable 

I 

for  each  of  us  to  continually  be  engaged  in  "reijvventing  th,© 
computer  wheel"  ;\ow9ver .  if  we  are  to  take  th§/fullest  advantage 
of  our  computer  tool?  we  must^ in  the  1970 's  address  ourselves  to 
the  problem  of  resource  sKaring  in  its  broadesr^contoxt .     It  is 
to  this  problem  that  this  session  is  devoted.  •Jy*^ 


QUAIfTUM  CHEMISTRY  PROGRAM  EXCHANGE 


and 

PROSPECT  FOR  A  NATIONAL'  QUANTUM  CHEMISTRY 
LABORATORY 

Harrison  Shull 

QCPB  was  born  at  th«  first  Gordon  Gonferance  In  Thooretical 
Chemistry  held  at  TUton,  New  Haapsl^xe,  Ju;Ly  9-13,  '1962,  at 
which  tiae  Professor  Shull  offered  th|e  facllitlf s  of  Indiana 
University  in  encouraging  the  exchange  of  computer  programs ♦  The 
exchange  was  begun  by  Issuing  a  mineographed  statement  of  pur<» 
pose?  and  details,  and  a  q[uestiohn&ire  eliciting  interest.  The 
first  newsletter  with  its  characteristic     monogram,  designed  by 
Dr,  Ke4[th  Howell  as  an  adaption  of  the  monogram  for  Indiana 
.Uhivjerpity  uiillelng  small  H"s^or  an  appropriate  Quantum  Chemistry 
Symbol,  appeared  with  a  datjuline  of  A^oril,  I963*    Thli!^  original 
issue  contained  ^11.  propays,  all  from  Indiana  University  and  a 
newsletter  dUL^tribution  list  of  72  names  supplemented  by  four 
others  in  the  newsletter  itself. 

As  the  exchange  grew  in  sice,  much  of  the  processing  had  to  .  * 
be  mechanised,    Initj^  programs  for  accomplishing,  this  we^  ^ 
written  by  Dr,  Franklin  Prosser,  and  these  have  subsequently  been 
modified  and  added. to  Mr.  ^Richard  Counts.    The  Air  Foroe  Office  . 
of -Scientif io  Research  provided  support  for  the  exchange  beginning 
in  the  fall  of  1964,  and  newsletter  8  of  January,  1965,  was^  the 
first  to  bear  the  cream  and  crimson  covers  advertising  this'  generous 
support.    In  July,  1964,  we  began  publication  of  summaries^ of  curreiit 
research  in  progress.  ^  October,  1964,  ended  Dr^  Howell's  associmtibn 
with  qpPE  when  he  returned  to  Great  Britain.    At  that  time.  Dr. 
Prosser  assiaed  direct  supervision  of  the  exchauige.  i  • 

In  October,  I966,  distribution  of  aodit  prograiAs  yias  shifted 
from  punched  cards  to  magnetic  tapes.    In  Ja^nmry,.  1^6?,  Mr. 
Richard  Counts  assumed  administration^  of  QGPE,  he  still 

undertakes.    In  January,  I968,  we  introduced  hahdling  charges  to 
compensate  partially  for  program  distributibn  costs ^    The  news- 
letter in  February,  1970,  brought  some  inliding  that  the  funding 
pattern  might  be  expected  to  change,  but  it  wasn't 'until  April 
that  we  received  word  that  support  from  the  Air  Force  vould  cease 
^s  of  June  30,  1970.    By  stretching  fimds  and'  by  orderly  closing*- 
out  of  accounts,  we  financed  the  August  distribution  %n  the  final 
Air  iForce  report. 

A  final  report  on  previous  finai^cea  is  in  order.  Through 
June  30,  1970,  a  grand  total  of  $108,651  was  received  from  the  AP 


OSR*ln  support  of  QCPB'  t6T  a  six-year  ptriod.    In  addition, 
Indiana  -UniYtraity  contrilmtad  approxiaately  $16,600  in  computing 
tiflke  mnd  in  ral^asad  tiae  of  p3ro  j6ct  supervisors,  not  counting  any 
,of  Professor  Shull^jtiae.    As  direct  receipts  from  the  membership^ 
through  June,  197P,  largeiy  in  payment  pf  postage  co^ts  for  pro- 
grams mailed  out,  we  repeived  $2,200*    It  ds  a  pleasure  to  acknow-- 
ledge  ^he  enthusiastic  support  of  Amos  Horney  of  the  Air  Fproe  , 
Office  of  Scientific  Research  and  ^of  his  colleagues  during  this     '  /  , 
period*    Ve  know  they -regret  as  much  as  xe  do  the  tezmination  of 
itheir  support*  '    ' , 

-Negotijitions^were'al^  onc^  entered  into  with  the  NatitoiCl' 
Science  Foundation  dffice  pf  Computing  Activities  to  psroyiq^ 
.  iiiterim  continuing  supp6rt  oS  QCPE,    Beginning  in  October,  1970, 
partial  support  for  a  three-year  perlodj^waa  received  covering 
tvXl  anticipated  expenses,  for  1970-1971#  two-thirds  of  expenses 
anticipated  for  1971-72,  and  one-third  for  1972-73*    In  the 
meantime,  it  is  hoped  that  membership  fees  and  program  distribu- 
tion fees  will  build  up' to  permit  continuation  of  the  exohaiige« 

■  Aft  of  June  3i|^71f  o\xr  active  membership  role  was  826,  down 
from  about  1,500  frSfe  memberships  the  previous  August*    For  this 
first  |Mar  of  operation  we  budtjjefted  fee  receipts  at  about  $15,000, 
and  through  June  we  had  received  almost  $11,000,  only  slightly 
below  our  expectations.    Except  for  the  period  from  July  1  tc 
Octjjber  15  when  we  had  zero  "support,  we  would  be  aKproximately 
on  schedule  in  developing  a  self-supporting  activity  by  the  end 
of  the  third  year.    The  crucial  question  vill  be,  however,  to 
what  extent  memberships  are  renewed  .and  active  program  contribu- , 
tions  and  use  'Continue.    It  is  our  hope  to^inlain  QGPE  as.  an 
active  live  organissation  for  transmission  to  a  National^  Quantum 
Chemistry  Laboratory  if  the  latter  can  be  established. 

The  report  of  the  Bethesda  Conference  leading  to'  a  recommendation 
of  that  group  for  formation  of  puch  a  laboratory  hah  been  distributed 
to  the  participants.    The  report  was  reviewed  Iqr  the  Committee  on 
Scii^ncb       Public  Polipy  of  the  J^ational  Academy  of  Sciences, 
''which  quite  properly  declined  to  lend  its  backing  at  this  stage  to 
the  positive  r?coamondatipn  for  such  a  laboratory.    COSPU^  did, 
*  however,  support' a  study  io  depth  by  the  chemistry  CQflaunitr  of 
•t'he  need  for  and  feasibility  of  such  an  'organication,  and  the 
Govem^Lng  Boatid  of -the.  National  Research  Council  has  agreed  to 
sponsor* such  a  study  if .f\mds  can  be  found.    Negotiations  eore 
currently  in  progress  with  the  National  Science  Foundation  to 
mount  su6h  a "study.. 

We  wan^  to  make*' it  very  clear  that  the  proposed  laboratory  is 
not  designed,  prirfarlly  aj3.,a  research  tool*  for  Quantum  Chemists,  but 
rather  as  a  pomp\itational  facility  capable  of  the  production  of 


theoretical  computations  of  use  'to  the  general  cbitKisiry  coaaunity 
for  the  solution  of 'cheaical  problems.    The  committee  to  sttdy 
the'^boratory  if  funding  is  obtained,  will  therefore  be  broadly 
abased,  representative  of  the  total  spread  .of  pbemi^try,  a^  in  ^ 
addition  will  include  experts  knowledgeable  in  computer  and  ^ 
.communication  technology, 

/  '  We  append  hflire  a  table  indicating  the  groKth  in  membership' 
and  use  of  ^dPB  during  these  3rear8y'  Ve  wish  to 'point  .out' that  tbo 
membership  figures  represent  active  members  only'^ .  The'  list  has 
been  culled  each  year  to  eliminate  the  names  of  those  who  ho  longer 
maiirtain  aA  active  interest  in  QGPE,  ' 
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A  PORVARD  LOOK  '  i 

Lawrence 'G.  Roberts  ^ 
'    '  •    -Advancf^  Research  Projects  AgQticy' 

ComT).uter-to-co)nput0;r  data  communl».cations  '^o'day  might  be  com- 
pared with  people-to-people  communication  v£a  telegraph  before  the 
day  of  the  telephone.    Sending  ^  message  by  telegraph  was  so  slow 
th^  the  media  could  only  be  used  for  non-interactive  transmission 
o-f  essential  information,    As  such,  its  use  was  limited,    The  tele- 
phone provided  an  ability  for  people  to  interact,  thus  permitting  a 
whtoie  new  range  of  applications^^.   Considering  people'  somewhat 
mechanistically,  we  might  view,  their  use  of  the  telephone  as  inter- 
human  .resources  sharing,    To  solve  a  problem  a  man  will  call  t^ose 
people  who  have  .bits  of  data  which  he  heeds  and  wiy  call  on  specialists 
for  opinions,  thus  making  use  of  other  human  resources.    This  is' 
acrfiieved  because  the  pedik  is  appropriately  responsive  for  human 
requirements  and  permits-  interactive  conversation,  thus  eliminating, 
the  need  for  trarremitting  excessive  detail,  much  of  which  may  be 
unnecessary.    Also,  with  an  interactive  dialojg,  information  does 
not  need"  to"  be  i'obnatted  i^i  a  standard  way  since  details  can 
always  be  clarified  if  misunderstood.    This  increase  in  utility  * 
and  the  many  new  applioations  .theifeby  permitted  have  resulted,  as  ^ 
we  know i^tttsA^ vast  increase  in. telephone  traffic  volume  over 
telegraph  tr^fic  levels,  '     '  -  ^ 

Communication  between  computers  would  mogt' likely  be  effected 
in  an  analogeous  -manner  if  a  data  communication  system  were  ma/ie 
available  which  matched  the  ndeds  of  computers  as  we3,l  as  the  phonA 
matches  the  needs  of  humans.    Such  a  system  would,  of  course,  have 
to  have  different  technical ^parametftrs  such  as  comioctlon  time, 
.data  rates  an^^  reliability,  than  those  required  for  voice  communi- 
cation, but  if  it  permitted  truly  interactive ^conversations 
between  a  larfe  ensemble  of  compoters,  the  effect  should  be  much 
the  same  in  that  it  would  permit  remote  access  to  specialized  ^ 
•hardware  and  software  resources,  .ioint  problem  solving  and  the 
dynamic  retrieval  of  data  from  remote  files^    The  analogy  with. the  ^ 
telephone  is  just^  one  way  of  examiniijg  the  potential  impact  of 
substantially  improved  data  communication  between  computers  and  the 
resultant  increase  in  applications  and  traffic  which* such  a  chanf^e 
mi^ht  bring' about,  *  .  '  • 

Intercomputer  communication  has.  many  quite  substantl^il 
differences  from  interpersonal  voice  communication,    Whfereas  voice 
convetsation  is  a  rather  continuous,  constant  data  ratft  process, 
communication  with  computers,  either  from  computer  consoles  or  ' 
other  computers,  requires  a  burst  t'^nsmission  rate  S'^veral  orders 
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of  magnitude  higher. than  the  average  rat$^  even  durlnr  a^sinfl:]e.  . 
conversation,    Since  therfe  has  been  very \lit tie  experience  so  far 
with  real  intercomputer  traffic  where  two  pfbgrsinis/  are  talkinr  to 
eagh  other,  it  is  useful  to  examln*  the  charaoteristics  of  computer 
console  traffic  which]  is  botl)  a  component  of,  'aLnd  is  also  likely 
to  have  the. same  general  parametierfe  as,  ^*omput^r-to.-computei?  traffic. 
Prom  statisti^cs  on  teletypes,  graphic  consoles  and  re^pole  batch 
stations^  it  appeairs- that  the  ratio  of  burst  rate  to  average  rate  is 
approximately  100  to  1>    This  means  that^lf  a  star^dard '  cbtnmuni- 
cations  line  is  es.tablished^for  ft  computer  ccJnversation,  tVie, 
average  Utilization  of  "that  line  wjill  only  be  abouf  \%  and  there-  * 
fore  the  cost  will  be  10-100  times  higher  than  the  raw  cost  of 
moving  the  bits,    A  second  characteristic  of  computef-to-conputpr 
communications  is  thayt  the  co'^rect  time  t'^.  es+ablish  a  conversation 
must  be  short  era&ugh  that  the  computelrs  qr'the  comput^  usei?S  are 
not  held  up  unduly  >^en  the  ,need  ^  tcr  access'  a  special  resource  is 
determined,    For  computers  the  "connect  time'"'  should  "l^,, consider- 
ably le^s  than  a  second  as  opposed  to  the-  20  or  30  seconds  .commonly 
experiences  for  voice  communications,    Thirds  the  maximum  ciata 
rate  required  in  man-machine  interaction  must.be  considered. 
It  is  known  that  for  useful  comprehension  by  a  huma^;!,  the  peak  ■ 
data  irate  for  graphical ,  material  is  on,  the  order  of -20  Icilobits  per. 
second,  which  suggests  the  requi-^ed  bandwidth' for  console- to- . 
computer  communicatidfis.    This  also  suggeV>>*^  at  least  a  minimum 
for  computer- to-computpr  communications,    Finally,  the  error    \  f 
, rate  for  intercomputer  traffic  must  be  far  lower, than  required 
for  voice  communications  or  computer  console  tfciffic  since  the^**^ 
Is  usually  very  little,  if  any,*  redundancy  inherent  in  the*  data. 
Por^many  prplications  the  error  rate  must  be  les§  than  one  in 
10     bils.   ^At  the  same  time,  the  rpiiability  (up  time)  of  the 
data  communications  syste^m  must  be  very  high  if  the  user  is  to 
depend  on  remote  resourfces.    The  cost  of  a  data  sf^rvice  providing; 
•the  charactpristics  putlined  above  must  be  compared  with  the 
-t^oRt  of  duplicating  the  computer  resources  involved,    V^ry  . 
simply,  if  th^  monthly  cost  of  ad^qik+p  communications  J^^tviop 
^xceeHc;,  or  even  ?^pproaohes  +he  cost  of  a  reasonably  well  endowp^i 
or^m^iit.or  i nstallj^tion,  it  is  not  likely  to  be  pconomical  to  us^ 
+  h^t  ^^mmunica'^^ons  ^service  rather  than  duplica+e.  thaj^  f^cilit-'r 
^"^bitr^^rllv  c^^ttinp*  ^  tbrpsbnld^at         of  a  computer  fr^rilitv 
OOS+ ,  i+  c^"^  be  p'^pdictM  t^?t  tHe  communications  system  '?h<^u**d 
net  ^0^"^  more  than  $10K  per  month  perfp'^p, 

A  few  vpars  ar^o^  no  comm)inicatlors  ^syp^+er  in  pxistence  pv<=>n 
cam^  cinsp  to  providing  thp  type  of  service  just  r^escribed. 
Therefore,  APPA  undertook  to  develop  such  a  carebility  so  as  +o 
make  'resource  sharing  between  computers  possible,    Tfie  communica- 
tions system  which  resulted  is  nti;^ized  in  the  ARPANET  and 
cuTT^en-Kly  intercopaects  more  than       computers  at  ^5  locati'^j^^i 


around  the  countty.  By  early  l972,  expajisiorv\to  25  locations 
is  expected,  (Figure  l);  A  delay-enginepred  message  snitching 
sysrtem,  the  ARPANET' consists  of  Interface  Message  Processors 

,  CD©s)  at  each  node  intercommunicating  over  50  kilobit  leased 
communication  lines  and  connected  to  one  or  more  Host  computers  , 

^at  6ach  site,    The  IMP  aCccepts  messages  from  ,thp  Host,  breaks 

'them  into  thousand  bit  packeiTs  and  sends  each  packet  toward 
the  destination  over  whichever  communicatidn^ine  is  currently 
optimal.    Each  IMP  in  turn  checks  the  error  detection  code 
on  the  packet  and,  if  it  checks,  routes  the  packet  on  to  the 
next^node  and  sends  an  acknowledgment  to  the  previous  node. 
At  the  destination,  padkets  are  assembled  back  into  a  message  ^ 
and  delivered  to  the  Host.    In  practice  this  organissation  proves 
to  be  extremely  responsive,  delivering  short  messages  anywhere 
in  the  country  within  .1  second  and  permitting  throughout  rates ^  , 
for  long  messages  of  up  to  80  kilobits  per  second.    By  adjusting 
the  number  of  communication  lines  which  are  leased,  the  networic 
can  be  engineered  to  have  -almost  any  desired  overall  average 
capacity /between  2  kilobits  per  node  and  60  kilobit^  per  node. 
Since  each  communication  line  is  being  used  for  traffic  between 
many  pairs  of  nodes  simultaneously,  it*  can  be  loaded  quite' 

'  efficiently  even  though  the  individual  Host-to-Host  conversa- 
tions have  such  a  high^ratio  of  bursji  rate  to  average  rate.  The 
actual  cost  of  the  total  network  communications  system  including 
the  cost  of ^IMPs,  maintenance  and^communication'  lines  ranges 
from  $3K  to''$6K  per  month -pef*  node,  depending  oh  the  overall 
traffic  levels  and  the  facilities  requi^red  at  each  node.  For 
-new  people  entering  the  i*ietwork,  the  February,  1972,  network 
of^.23  nodes  is  currently  estimated  to  cost  $4,8K/node/month;  $3. IK 
for  an^equal  share  of  the  communication  lines  cost  and  $1.7K  for  the 
lease  pf  a  minimal  IMP.    If  a  user  wishes  i;o  provide  direct  console 
access  ta  the  network,  .a  Terminal  Interface  Processor  (TIP*)  yfonld 
be  used.  N^e  TIP.  which  will  become  available  in  August,  1971 > 
will  act  both  as  In  IMP  and  as  a  simple  host,  permitting  up  to 
6k  consoles  and  peripheral  devices  to  intercommunicate yi*ith  any 
host  in  the  network  at  rates  up  to  2Q  kilobits/sec.    Thus  tM 
TIP  expsuids  the  network  concept  to  incluae  nodes  without  an 
interactive  host  of  their  own,  but  who  wis)i  high  bandwidth  support 
for  graphic  consoles,  printers,  and  large  collections  of  lower  ^ 
speed  devices.    Use  of  a  TIP  increases  the  cost  by  $l,^K/mo. 

Although  an  equal  share'  of  the  communications  line  cost  is 
currently  allocated  to  each  node,  this  policy  will  be  changed,  as 
soon  as. feasible,  to  one  of  charging  only  for  the  bits  actual lly 
sent  from  each  node.  Referring  to  Figure  2  it  can  be.  seen  that 
the  cost  of  the  network  increases  almoAt  linearly  with  capacit:^, 
at  least  for  bandwidths  below  l6  KB/node.    Also,^*3rt  turns  quit 
that  the  capacity  and  cost  of  thpse  distributed  networks  are 


■  401.  . 


remarkably  insensitive  'to  the  dist^ribution  and  destination -of 
traffic,  the  total  traffic  being  the  oply  important  parameter. 
Thus,  it  is  appropriate  to  charge  for  traffic  initiated  at  a 
node,  based  6n  the  cost  of  increasing  the  total  capacity  of  the' 
netifork  by  that  amount.    From  Figure  2  it  can  be  seen  that  this, 
will  be*ll0/megabit  for  the  ARPANEI^,    However,  since  the  netwoi* 
cannot  be  expected  to  be  always  fuy.y  loaded  to  pea3c  capacity, 
.  day  and  -night,  it,  is  likely  the  actu^al  rate  will  be  300/meeabit 
based  on  an  estimated  j6%  average  loading.    The  total  coat  per 
node  would  then  be  $l,7K/month  plus  30?i/megabit,  -  . 

.Looking  ahead,  assuming  the  broad  availability  of  a  data' 
communications  service  similar  to  the  ARPANET  system,  it  is  clear 
that  v.ery  significant  changes  in  computer  system  organization 
will  take  place.    Some  of  these^ changes  will  occur  rapidly— -within 
the  first  five  jears~and  others  will,  take  a  decade  or  more  before 
people  fully  accept  the  concepts,  *Soon  after ^a. network  with  a 
dozen  or  more  reliable  computer  service$(^^becomes  availiHe^  many  -  \ 
institutions  will  ^  find  it  far  more  econoii\ic,  to  obtain  t^lr 
computing  services  from  a  sielected  set  of  these  remote  systems, 
rather  than  run  their  own  computer  center,   .For  example,  take  the 
casn  of  an  institution  about  -ready  to  upgrades  its  facility,  On^ 
chosen  would  be  t^'o'btain  a  medium  scale,  general  purpose  batch 
System,  admitted] y  9,  compromise  for  their  lar/^e  numerical  lasers 
and  tlne-shar? rg  users,  but  +he  best  single  pij^tem  that  the;'  ^.^r^ 
^ffo^.    Altemattvply,  they  opuld  bby  no  new  *achinp  and  oh+.^i;i. 
access  to  several  of- the  systems  on  the  Network  through  a  Term'^^.'*! 
Jnterfare  Processor,    This  approach 'pprmits  tTtelr  Targe  numerical 
*us9rp  to  US'*  a  Istrge  "number-cruncher";  their  Statistical  and 
payroll  users  to  access  a,la!rge  .scale  general  purpose -system,,  and 
th«-^r  Interactive  users  to  have  teletype  or  c^.phic  console 
access  to  a  good  time-sharing  system,    OveralL  the  cost  of  each  . 
se-nrice  is  less  than  it  wguld  have  beem  on  a  dedicated  G-P  computer, 
by  factors  between  two  and  t^n.    Also,  they  can  buy  .just  the 
capacitjr  they  need  and*  expand  smoothly  rather  than  having  to 'pay 
fo?^an  oversize  machine  for  a  year  or  two.    The  peripherals  cost 
the  same  in  either  case  and  the  network  cosi^  is  ^negligible  compared 
to  the  direct  computer  cost  savings.    As  added  benefits,  the 
computer  services  they  use  are  probably  better  run  and  more 
-►"p  11  able' than  they  could  hope  to  do  themselves  since  the  s^-nrice?; 
mu<?+  stay  competitive;  a  wider  range  of  software  is  available  and 
can  be  accessed  directly  wlthotit  translation*  or  transfer;  and  as 
nek  harrJware  is  introduced  which  Is  economically  useful,  thev  can 
transfer  lobs  -fo.  it 'on  a  selected  and  liesurely  basis. 

The  direct  use  of  distributed  hardw^^re  services  just 
d'^scribed  will  probably , account  for  most  of  the  Initial  use  of 
the^network,    This  {p:*owth  should  proce^^ '^bout  unlfoT^nly  over 


the  nexii  eight  years— two  computer  .replacement  cycles'.  Some 
additional  traffic  will, be  introduced  by  the  gradual  transfer 
of  cuprent  ^ta  traffic'.from  other  data*  communication  networks 
to  the  computer  network  due  to  the  economy  or  reliability,  but  the 
total  quantity  of, this  traffic  is  minor  in  comparison  to  the  new  • 
traffic  generated  by  the  computer  resource  sharing  ^ictivity. 

A  seppnd  major  application  of  the  computer  network* is  data 
base  sharing— <Hrect  retrieval  from  re'mote,  one  of  a  kind  data 
baset^.    Currently,  when  large  data  bases  or  files  are  needed  at 
several  computer  centers,  duplicates  are  maintained  at  each  center. 
This  difficult  anji  costly  practice  can  be  avoided  if,  the  access 
speed,  through  the  network  is  fast  anough  so  that  neither  human 
users  nor  computer  processes  ar^  unduly  delayed.    The  ARPANET 
response  speed  of  one-tenth  of  a  second  for  a  question  and  three- 
tenths  of  a  second  for.a  one-page  answer  *is  quite  acceptable  for 
a  human  user  and  for  a  computer  program  it  is  no  worse  than  a 
slow  disc.    To  start  with,  this  response-  appekrs  adequate;  however, 
further  experience  may  i-ndicate  a  need* for  faster  response  in 
future  networks.    Data  base  sharing  vill  hot  build  up  as  rapidly 
as  hardwaice  service  sharing, however,  since  it  represents  only  an 
incremental  saving  for  an  installation  and  demands  considerable 
faith  in  the  network*    Copying  a  10    bit  data  ba^e  monthly  migl^t 
cost  $2,000 — less  than*  the  minimum  ne'twork  cost  and^  therefor^  not 
a  prime  motivation  for  Joining  the  networfc.^  However,  the  cost  of 
accessing  the  data  base  through  t^e  network  would  cost  at  mdst 
$3Q0  even,  if  all  the  data  were  required,  providing  a  considerable* 
cost  saving  and  convenience  as  long  as  the  network,conrt#ctlon  had 
other  j^Y^tification.    Of  course  for  very  large  data  bases  such  as 
the  10.    bit  weather-cfimate  data  base  being  developed' by  the  kit- 
Weather  Service  for  the  ARPANET,  the  cost  of  eithet  copying  or 
storing  a  duplicate  would  immediately  justify  network  connection. 

In  most  cases  very  large  data  bases  would  not  be  developed 
at  all  without  a  network  maiking  possible  nation-wide  accesg,  since 
the  cost  would  be  prohibitive.    Data  base  sharing,  therefore,  is  • 
not  likely  to  grow  rapidly  until  the  network  i?  reasonably  well 
established,,  l^igging  the  service-sharing  growth  by  perhaps  two 
years  J  but  then  growing  exponentially  as  everycm^^requests  access 
to  aji  the  information  available. 

Software  sharing,  the  third  major  application,  is  the  remote 
use  of' software  subroutines  and  packages^' programs  not  aV'ailable 
on  the  users*  primary  computer  due  to  imcompatilibity  of  hardware 
or  languages.    An  example  of  this  type  of  activity  might  be  the 
use  by  HIT  scientist  of  the  Stanford .Heiiristic  Dendral  System,  a 
program  for  determing^'mqlecular  structure  given  the  mass  spectrum. 
On  a  computer  at  HIT^the  scientists  would  collect  and  preiy^ocess 
the  mass  spectrum,  data.    Then,  much  like  using  a  subroutine #  the 
Stanford  computer  would  be  called,  the  data  sent  and  the  molecular 


•  structure /when  determined,  sent  back.    If  interaction  vere  required, 
the 'MIT  scientist  would  be  interrbgated  much  as  if  he  wereyat  slan-" 
ford,  thus  building  up  the  heuristic  model  biased  on  nation-wide 
inputs.    The  MIT  computer,  upon  receiving  the  response,  would  proceed 
aocally  with  the  calculations  or  displays  desired*    Software  sharing 
like  this  will  be  r9quired  if  we  are  to'' maintain  maximum  progress  as 
the  volume  of  useful  software  continues  to  'expand.    Since  thQ  aiinual 
cost  for  software  is  already  larger  than  that  for  hardware  and,  to  " 
some  extent, 'should  be  cumulative  tather  thsui  wearing  out,  the  long 
range  importance  of  software  sharing  is  clearly  greater/   However,  due 
to  human  inertia  and  a  strgng  "not  invented  helre"  syndrone  associated 

"with  software,  it  is  clear  that  the  cros^-utilization  of  software 
will  take  years  to  develop.    The  buildup  of  software  sharing  activity 
will  most  likely  begin,  very  slowly^  growing^  exponeotially,  but  not 
become  ajnajor  fa^tor'imtil  the  network  becomes  well  establisVied  in 
four  to  eight  yeatrs. 

Besides  hardware,  software,  and  ^ta  base  sharing,  there  arf^ 
many  oth^r  i*'portant  network  aDBlicatldns,  alT  /of  which  requirn  & 
inrj^e  viable  network  before  they  become  irtPportarit  In  their  own  right,' 
,  These  include  teleconferencing,  publishing,  library  ser^rices  and 
dffice  paperyork  filing  and  distribution,.  Ten  to  twenty  years  from 
now.  tl^ese  applications  may  well  dominate  comput'er  usage  and  network 
usage  but  they  are  not  likely  to  be  Mportant  factors  for  at 'least 
five  years,  ^     ^  ^  , 

Overall,  then,  hardware  service  shSxing  is  likely  to  be  the 
major  factor  causing  networks  to  come  into  existence  slince  the 
.  effective  cost  of  computing        be  drastically  lonered.for  only  a 
moderate  communications  cost ,^  Then,  data  base  shating  will  become 
the  don^inapt  force  expanding  the  traffic  in  Vhree  to  four  years, 
Software  sharing,  although  very  important  in  the  long  run,  wii;. 
not  become  a  major  factor  for  four  to  eight  years.    The  text 
Oriented  services,  libraries  and-^ffice  work,  will  then  come  into 
their/owa  In  ten  to  twenty  years.    The  whole  trend  should  decrease 
the  importance  of  the  general  purpose  computer  as  stand-alone  sys- 
tems, ajid  substantially  increase  the  importance  of  specialized 
systems—ones  which  can  provide  a  specific  service  at  the  lowest, 
oost;  *  ■  '  . 
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Introduction  -  ^  -  ,        t  ) 
 -P                          ^                                                    ,  ^  -  ^ 

1971  is  the  25th  anniversary,  of  the  first  ^large-scale  electronic  cqpputerj 

ENIAC,  designed  anji  built  by' Prof essors  Eckerjt  a^d  Maucl^ly  at  the  tlniyerBtty 

of  Pennsylvania. 


Tod>ay  some  100,000  computers,  are  in  operation  aroUnd  (he  world,  70^000  of 
them  in  the  USA.  ^  ^  '       ^     !  ** 

A  widespread  unit  record  punched  card  information  storage  and  hapdltng 
technology  constituted  a  fertile  environment  for  the. computer  which  led 
to  its  Jciipid  proliferation,  operating  in  stanpl-alone  mode,^  soon  after  its 
invention.    Mpre  re^cently  the  cbmputer  has  cotite  to  be  ititerfac6d  with,  and^ 
used  as  an^ important  component  o;E,  our  communication  network.    A  second 
rapid  growtK  period  in  computer  use  is  developing,  with  a  corresponding. rapid 
incl^eage  in  numbfer  and  variety  of  terminal  devices  providing  more  c.onvenient 
and  flexible  access  to  cqpputer  titlfHties.'" 

The  computer,  more  accurately  named  the  informatUon  pro.cefiping  machine,  has 
impacted  every  area  of  human  endeavor.  Yet  its  potential  in,  the  service  of 
mankind  remains  largely  untapped^'  its  principal,  applications  continue  to  be 
only  the  two  conceptually  primitive  ^nes:  as  a  giant  desk  calculator  serv- 
ing engineers  and  scientists,  and  as  a  giant  accounting  machine  serving 
administration. 

i  '  ; 

Computers  in  Higher  Education 

Computers  came  to  a  few  college,  and  to  many  university  ctopuses  in  support 
of  administration  and  in  support  of  graduate  rersearch,  primarily  as  a  giant 
desk  calculator.  ' 

At  those  campuses  with  extensive  research  activity,  the  computation  centers 
gfew  very  rapidly,  depending  very  heavily  on  federal  support,  and  having 
Bs  their  primary  objective  support  of  graduate  research.  . 

Relatively  little  use  was  made  of  ±h^  computfec  in  undergraduate  programs, 
let  alone  in  high  school  curricul^  even  though  .it  was  becoming  evident 
that  the  problem  solvers  and  the  decision  makers 'of  our  society  were 
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.        /  \      •  ,  •  ^  • 

discovering  new  and  more  powerful  techniques  for  doing  their  work  which 
depended  in  an  Essential  way<,on  the  computer.    Clearly  Education  was  ^ 
failing  to  meet  its  responsibility  of  preparing  the  student  of  today, to 
be  th6  citizen  of  tomorrow, 

N$F  Regioaal  Networks       ^    i  * 

,  in  Septen^ber  of  1967^  the  newly  formed  Office  of  Computing  Activities  of 
the  l!?.^tional  Science*  Fpundation  convened  30  Coiiputer  Center  Directors  from 

,  ai:oUT>d,.^th^  country  in  order  to  discuss  thfe  feasibility  of  diffusing  their 
c-omputet  6xpettise  and  computer  service  to  neighboring  institutions  of 

'  higher  leatning,  particularly  colleges.    Of  those  3D,  nine  submitted  pro- 
posals which  were  funded  in  FY  1^968  and  thus  were  created  the  first  regional 
networks,  namely:      '*  -      '        ,  ^ 

St.  '^ibelm's  College,  New  Hampshire,  plus  Ave  colleges 
Carnegie-Mellon  University,  Pennsylvania,  plus  ten  colleges 
Burdue  University,  Indiana,  plus  six  college^ 
.  University  of  Iowa,  Iowa,  plus  ten  colleges  , 
Southern  Regional  Education  Board,  Georgia,  plus  ten  colleges 
,  .    Illijioiff  Institute  of  Technology,  Illinois,  plus  *nine  colleges 
tornell  University,  New  York,*  plus  five  colleges 
'Dartmouth  College,  New  Hampshire,  plus  nine  colleges 
•  Oregon  State  University,  Oregon^  plus  six  colleges 

Sinte  that  time  the  numbex  has  grown  to  23,  supporting  212  colleges  (107o  of 
the  nation's  total  universities  and  cplleges),  with  the  emphasis  shifted 
from  computer  programming^ins truction  and  computer  service  to  discipline-^"^ 
oriented  faculty  training  and  curriculum  developmetit.  ^  .  / 

Funding  and  Future  of  University  Computer  Centers 

The  ^e-empha^is  on  funding  computer  service  per  se  has  corresponded  to  a 
fairly  abrupt  and  sharp  reduction  in  federal  support  of  university  computer 
centers  in  general.    That,  together  with  the  general  depression  of  the 
econpmy  over  the  past  two  years,  has  created  a  major  support  problem  for 
university  computer  centers  approaching  a  crisis  situati'on. 

Additionally  university  research  scholars  whose  research  progress  ,has  as 
one  limit  the  size  and  speed  of  the  computer  available  to  them,  are  working 
at  a  level  beLpw  that  of  current  technology  because  no  university  has  on 
its  campus  the^most  powerful  computer  extant • 

A  recent  two-year  study  by  the  RAND  Corporation  suggests  that  the  day  of 
the  single  large-scale  university  computation  center,  attempting  to  support 
the  ^iltire  range  of  computational  needs  of  administration,  research,  and 
education,  is  fast  passing.    A  multifaceted  commercial  computer  utility  '  * 
spectrum  is  evolving,  made  all  the  more  viable  with  the  increase  in  the 
data  communication  sophistication  of  the  communication  indus'try. 
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That  study  also  suggests  that  development  and  wide  distribution  of  cotoputer- 
based  curricular  elements  will  occur  at  such  t'jme  as  the  commercial  sector, 
such  as  the  textbook  publishing  industry,  ent^i*B  the  picture  with  a  royalty 
system  for  the'  authors  and  a,  standardization  ^tid  distribution  system  for 
the  audience.  -'f 

; *  . :  '  '  • 

Neither  of  the  two  developments  foresee9  by,,^He'  RAND  study]  yet  had  manifested 
itself,  however, /and  the  NSF  developed  system'of  regional  networks,  iihich 
has  caused, a  rapid  expansion  of  undergraduate  computer' use,  bath  in  amount 
and  in  discipline  range,  continues. to  be  the  major  force*    Not  the  least  of 
the.  factors  operating  16  the  low  cost  possible.    For  example,  using  IIT/s 
Remote  Job  Entry  fast  batcK  system  with  turnaround  times  of  less'  thari 
i  minutes,  contact  time  charges  for  TTY*s  of  ^2.00  per  hour,^  and  an, efficient 
compiler  with  minimum  charge^  of  25$  per  run,  has  pTermitted  as  mfiny  «s  12  - 
small  student  programs  to  be  batched  together'  as  one 'run  befdr\  the  pinimufi^- 
charge  for  UNIVaC  1108  CPU. and  channelj^timg  of  25(?  was  exceeded. 

Cixxt iculum  Deve lopment  ^  '     *      *  '  ' 

The  three-day  NSF-suppor ted.  Iowa. conference  q£  June  1970,  tflslgned  t'o  deiil 
with  broad  brush  use. of  the  computer  in  undetgraduate  curticula,  drew  740 
paretcipants  and  hundreds  of  pap'ers.    Its  successor,  the  Dartmouth  conference 
of  June  1971,  drew.JLZOO  participants  and  hundreds' of  papers.  ■    -  ^, 

Most  of  the  creative^, effort  tias  be^sij,  limtt^A' to  producing  computer  program^  . 
for  handling  apeciflc-  clas^^es  qf  j)i:oblems  where  the  computer  is  used  as  a 
desk  calculator.    However  simulation^  particula^rly  in  economics-^  and  "manage- 
ment, and  non-numerical  applications,  are  beginning  to  appear.   *Also  use  of 
the  small  deciicated  computer  in  a  control  environment  is  represented  in  at^ 
least  a  dozen  colleges  and  universities  in  the  undergraduate  program:. 
These  specific  applications  have  no^<'yet  caused  any  sweeping  revision  of 
courses,' let  alone  of  curricula.    That  is  yet  to  come* 

As  a  distribution'mediumi  the  standard  journals  serving  the  several 
disciplines  arfe  sttll  groping  trying  to  discover  whether  algorithms  are 
publishable  and,  if  so,  in  what^form.  * 

Conferences  with  proceedings  'con^stitute  yet  another  distribution  medium. 

Computer  program  exchanges  are  a  viable  means  of  distributing  computer 
programs  and  corresponding  'descriptions.    These  have  sprung  into  being  as 
part  of  several  of  the  regional  computing  activi^ties  and,  at  Eastern 
Michigan  University,,  as  a  regional,  discipline-oriented  entity  "which  exists 
as  a  collectio!;i  and  distribution 'center,  and  as  an  .education  center.  There 
has  not  yet  appeared  a  comprehensive,  nation-wide,  standardized  program 
exchange  such  as  the  research-oriented  QCPE.*i  * 

*  Quantum  Chemistry  Rrogram  Exchange,  Indiana  University 


State-Wide  Networks 


The  trend  seems  to       toward  regional  networks  organized  on  a  state-wide 
tasis.    This  has^already  happened  in  North  Carolina  ^nd  in  Georgia 'on  a 
large  scale,  and  many  other  states  are,  well  on  their  way  to>  similar  .systems ^ 
Such  systems  are  involving  both  the  public  and  private  institutions  although 
they  tend  to  be  dominated  by  the  public  system.    Exceptions  Occur  in  that 
multxstate  cooperatives  are  being  formed  involving  *low  population  statxes 
and  in, that  metropolitan  cooperatives  are  being  formed  in  major  cities. 

The  IIT  Regional  Network--A  Particular  Experience 

One.  Qf  the  original  nine  NSF-supj)orted  regional  "netwoyjks , 'the  IIT  .network, 
was  unique  in  that'  it  had  at  the  onset  faculty  training  and  ,cuf riculum 
'development  as  its  priiQ^a-ry  thtust.  '  That  project  embocfies '  all  tfie  elements 
of  a  regional  networkr^  ^V^n  those  recently  formed,  and  the  Introduction  and 
Summary  from^  the  final  report  for  the  project  is  herein  appended  in  order 
to, impart  some  of  the  flavor  of  regional  networking  and  Co  serve  as  a 
concluding  statement  for  this  paper. 

i  < 

FINAL  REPORT  ON  THE  IIT-CENTERED  REGIONAL  COMPUTER  FACILITY,  FACULTY  TRAINik, 
And  CQOPERATIVE  program  in  curriculum  DEVELOPMEOT:     INTRODUCTION  AND  SUMMARY 

PETER  G.  LYKOS 


History  and  Objectives  .  '  *  ' 

In  July,  1968,  the  Rational  Science  Foundation  made  a  grant  to  Illinois 
Institute  of  Technology  and  nine  participating  colleges  and  universities  in 
northern  I^llinois  and  southern  Wisconsin  for  purposes  of  engaging  in  a  joint 
program , in.  the  development  6f  a  regional  .computer  netwotk  and  in  faculty 
training  and  curriculum  development  involving  the  introduction  of  the 
computer  as  an  instructional  tool  in  undergraduate  courses.    The  project 
was  for  a  combined  total  of  $885,000  for  a.  period  of  26  months  ending 
September  1.970  (subsequently  extended  through  June  1971).     In  addition  to 
Illinois  Institute^  of  Technology,  the  nine  participating  institutions  were: 


Chicago. City  College    (four  of  eight  campuses) 

JDominican  CoJLlege  (Wisconsin)' 

;jE:imhar St  College  '     /  * 

.Loyola  University    (fi^o  campuses) 
^  Monmouth  College 
VMundelein  College 

^  Quincy  College  '  ^      •  '/ 

,Ripon  College  (Wisconsin)  '  '  ,  "  / 

St*  Xavier  College 


410 


■7-17 

In  1969,,  Blackburn  College  and  Whedt9n  College  joined  the  project  on  a 
self-funding  basis*  . 

The  main  group  (9)  of  participating  institutions  consisted  of  liberal  arts 
colleges.    The  exceptions  were  a  large  university  (Loyola),  a  large  city 
college  system  (Chicago  City , College,  with  nine  branches),  and  an  institute 
of  technology  (IIT)  specializing  in  science  and  technology. 

Except  for*  IIT,  none  of  the  participating  institutions  had  stand-alone 
computer  facilities.    In  fact,  all  had  not  previously  used  computer  facilities 
to  any  significant  extent.    In  those  cases  in  \^*ich  there  was  previous 
experience,  it  was  initial  and  preliminary  and  the  same  applied  for  instruc- 
tion in  programming. 

IIT,  as  the  lead6r  institution  in' the  setting  up  of  the  regional  computer 
network,  had  a  modest  mecjium-scale  computer  facility,  'an  IBM  360/40  with 
an  IIT-developed  remote  job  entry  system  and  a  cCThver^ational  language 
supporting  remote  teletypewriters  (later  changed  to  a  UNlVAC  1108  configura- 
tion)     In  addition,  IIT  had  extensive  experience  over  many  years  in  pro- 
gramming language  instruction,  ranging  over  diverse  groups,  from  high  school 
students  through  c(^llege  instructors. 

The  essential  objectives  of  the  project  were: 

1.  Development  of  operating  remote  tenninal  facilities  at  the  other  , 
participating  institutions  accessing  a  common  flexible  computer. 

2.  Stimulating  programming  language  instruction  and  other  means ^of 
developing  d^user  community  of,  computer  facilities  at  the  other 
participating^ institutions,  and 

3.  .Faculty  training  and  faculty  involvement  in  curriculum  development     *  " 

involving  the  use  of  the*  computer  as  aji  instructional  tool  in 
specific  discipline  course  work. 

Since  IIT  had  been  operating  a  highly  successful  computer  network  for  remote 
users  for  some  time,  the  basis  of  the  mechanics  of  a  regional  computer  net- 
work was  already  present  and  there  was  little  wasted  effort  in  extending  the 
existing  network  to  these  othet  institutiofts  (although  the  radical  change 
in  central  computer  system  midway  through  the  project  worked  a  major  hard- 
ship on  the  remote  terminal  users).    There  were  then  no  major  dlfticulties  in 
making  arrangements  for  each  participating  institution  to  have  operational 
remote  terminal  facilities  relatively  quickly.  \ 

The  second  of  the  objectives  listed* above  was  also  relatively  quickly  accom- 
plished.   In  this  respect  the  previ'ous  extensive  IIT  experience  and  the 
limited' experience  of  some  of  the  groups  served  as  examples  of  operating 
^nechanisms  of  Instruction. 
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The  major  thrust  of  the  program  thus  was  the  third  objective  listed  above. 
This  was  one  stage  of  "nondevelopment*'  which  all  participating  institutions 
had  in  conmon.'   In  essence  the  primary  objective  of  the  program  was  the 
unique  area  in  which  there  was  almost  a  complete  lack  of  previous  develop- 
ment, or  even  planning  for  development  thereof.    It  is  most  interesting, 
and  indicative  of  the  very  real  need  for  %hB  support  of  a  program  such  as 
this,  that,  even  at  IIT  with  its  extensive  Computer  facilities  and  its 
extensive  background  in  instruction  of  this  project,  no  formal  or  even 
informal  activity  in  the  area  of  the  'development  of  curriculum  materials 
whicn  make  use  of  the  computer  as  an  instructional  tool  existed. 

Project  Structure    .  ^ 

Each  participating  school  recognized  that  faculty  training  and  faculty 
participation  in  curriculum  development  was  the  dost  important  objective. 
Bach  participating  campus  identified  a  number  of  faculty  in  a  variety  of 
disciplines  for  participation.  . 

The  basic  operational  mechanist  used  was  to  organize  the  participants  into 
discipline  study  groups.    Each] study  group  had  a  group  leader  drawn  from 
one  of  the  universities  In  the  greater  Chicago  area.    These  groups  .were: 

Biology  ^fiUkk^ess  and  Economics 

^  Chemistry    ^  Psychology  and  -Education 

Physics        '  ^  Sociology. 
Mathematics  \' 

Conspicuous  by  its  absence  is  Computer  Science  which  was  highly  developed 
at  IIT  but  nonexistent  at  the  other  participating  institutions  except  for  a 
data  processing  program  in  oqe  Of  the  non-participating  Chicago  City  College 
c^puses.  ^  *         ^,  • 

During  the  first  year,  'Psychology,  Education,  and  Sociology  were  lumped 
into  one  study  group.    At  the  end  of  the  first  year.  Sociology  was  made  a 
separate  group  while  Psychology  and  Education  were  left  together.  ^ 

6roup.  Leaders'  and  Administrative  Reports  * 

At  the  end  of  the  project,  reports  were  received  from  each  of  the  seven 
group  leaders  and  from  the  Project  Manager  (who  managed  to  acquire  a  Ph.D. 
in  Market^ing  at  Northwestern  University  during  the  project  period).  These 
repor^ts  were  edited  by_the  Project  Director  and  the  Project  Manager  and 
constitute  the  last  section  of  this  report.    Each  contains  an  account  of 
what  happened  in  each  discipline  together  with  observations  and  helpful 
hints,  some  of  which  transcend  the  particular  discipline.  Accofdingly, 
regardless  of  the' reader's  discipline  orientation,  a^  the  accounts  will 
contain  something  of  interest. 
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Interaediat^  project  reports  produced  earlier  in  the  project  have  been 
collected  as  Appendices  and  they  contain  much  useful  information  regarding 
project  structure,  project  start-up,  bibliographies,  program  exchange 
listings,  financial  analysis, .  etc.  .  •  ^ 

A  brief  analysis  of  the  computer  in  higher  ^education  was  m'a4&  by  the  Project 
Director  (Attachment  1  to  this  Introduction  and  Stnnmary). presented  as  one 
of  twenty  "provocative"  statements  at  the  RAND  Invi^tational  Conference  on 
"Computers  in  I^;istruction:    Their  Future  for  Hi'gher  Education"  held  in 
Beverly  Hills,  Calif ornia, , on  October  1,  2,  3,  1970.    That  analysis  is  based  • 
in  part  on  the  many  experiences  of  the  Project  Director  both  in  organizing 
and  in  conducting  the  tftipee-year  project.  > 

.Project  and  Project-Inspired  Conferences 

In  addition  tc  the  several  "in-house"  conferences , on  curriculum  development 
which  included  many  visitors  and  participants  from  outside  the  project,  two 
additional  conferences  of  national  scope  were  a  direct  consequence  of  this 
project.  ^  '  .        '  ,        ^  f 

The  first  such  was  the  week-Ion^  conference  "Computers  in  Undergraduate 
Science  Education:    Physics  and  Mathematics"  held  at  IIT  on  i^gust  17-21,  ^ 
1970.    The  conference  was  cosponsbred  by  IIT,  The  Commission  on  College 
Physics,  and  funded  by  the  Office  for  Computing  Activities  of  the  NSF.  ^ 
The  IIT  project  group  leader  in  physics  was  codirector  of  the  conference 
and  proceedings  were  generated.    There  was  considerable  emphasis  on  , 
computer  assisted  instruction. 
'I 

The  second  national  conference  w^s  the  week-long  conference  "Computers  in 
Chemical  Education  and  Research"  held  at  Northern  Illinois  University  on 
July  19-23,  1971.    The  conference  was  cosponsored\by  NIU^  American  Chemical 
Society,  National  Resear9h  Council,  and  funded  by  the  Chemistry  Section* 
and  Office  of  Computing  Activities  of  the  NSF.    The  Project  Director 
organized  the  conference  and  proceedings  were  g^n^rated.    ine  them^  ol  thV 
conference  emphasized'  th^  fact  that  the  computer  is  affecting  how  chemists 
do  chemistry,  correspondingly there  need  to  be^establisl^d  channels  of 
communication  between  those  ch^emists.at  the  frontiers  wh6  are  developing 
innovative  approaches  to  chemical  problems  tapping  the  potential  of  the 
computer,  and  all  other  chemists,  particularly  those  responsible  for 
chemistry  curricula.    The  idea  here  is  that  whereas  some  existing  chemical  * 
^problem-solving  methods  can  be  speeded  up  u6ing  the  computer,  such  a  v^ry 
limited  approach  falls  far  short  of  realizing  the  potential  of  the  informa-  . 
tidn  processing  machine.  ... 
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Participating  Campuses'  Post  Project  Conft)uting 

A  few  comments  are  in  order  regarding  what  the  participating  campuses* are 
doing  now.*. 
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For  example,  the  Chicago  City  College  System  has  created  an  IIT-Faculty 
Training  and  Curriculum  Development-like  project  at*  Wright  College,  one  ^  * 
of  the  four  City  College  campuses  which  did  not  participate  in  th'e  IIX- 
dentered  project.  Six  faculty  m^mb^ers  from  six  disciplines  have  been 
released  part-time  from  their  teaching  duties  for  this  purpose  and  they 
are  being  guided  by  a  Wright  professor  with  considera*ble  experience  and 
many  contacts  in  the  "Computers  in  Higher  Education"  milieu,. 

Quincy  CollegeJ ^ which  is  geographically  isolated,  has  replaced  its  tele- 
typewriters with  a  programmable  terminal  and  is  using  both  the  University 
of  Iowa  IBM  360  and  the  IIT'UNIVAC  1108.   'This  constitutes  a  conceptual 
breakthrough  in  that  the  faculty  are  free  to  Shop  around  having  different 
menus  to  choose  f ronj.  •  * 

Ripon  College  has  pj^rchased  a  mini-computer  which  siypports  teletypewriters 
in  time-sharing  mode.  '  In  addition,  there  is  "an^ attempt  underway  to  adapt 
that-  system  so  it  can  operate  as  a  medium  speed  programmable  terminal  to* 
accpfis  other,  cbmputers  elsewhere  such  as  at  Madison  and  Milwaukee,  Wisconsin, 
and  in  the  greater  Chicago  area.    Finally,  Ripon  is  creating  a  small  network 
of  its  own  involving  local  high  schools  including  faculty  training. 

K 

'  The  remaining  schools  have  either  arranged  for  their  own  local  mini-computers 
orvare  continuing  to  get  service  via  telephoYie  lines  from  remote  computers. 

In  every  case  there  has  been  a  significant  impact  on  the  academic  program 
in  an  amazing  variety  o%  ways.    The  prihcipal  inhibiting  factor  operating 
at  all  participating  campuses  continues  to  be  lack  of  faculty  training  and 
interest. 

Greater  Chicago  Area  Computer  Consortium 

The  State  of  Illinois  Board  of  Hig^r  Education  recognized  the  potential  of 
individual  institution  pjogram-enhanceirient  through  cooperation  with  neighbor- 
ing institutions  and  thus  stimulated  the  formatioa  of  a  committee  represent- 
ing the  thirty-odd  insititutions  of  higher  learning  in  the  greater  Chicago  ^ 
area.    The  Project ^Director  chaired  the  subcommittee  charged  with  cooperation 
involving  computers'.    That  subcommittee's  report,  subsequently  approved  by, 
the  Board  of  Higher  Education,  outlined  a  cooperative  prOgraip  evolving 
through  three  stages  based  essentially  On  the  experience  derived  from  this 
project. 

Computational  Laboratory 

'  n 

Although  the  principal  problem  continues  to  be  training  of  faculty  on  the 
impact  of  the  computer  on  their  discipline  content  (as  well  as  an  awareness 
of  *  the  current  state  of  the  embryonic  computer-assisted  pedagogy) j  neverthe- 
less, the  secondary  question  of  cSmputer  support  for  education  has  not  yet 
been  adequately  atidressed  even  within  today's  standard  computer  techno l>ogy. 
Accordingly,  a  second  outgrowth  of  the  IIT-centered  regional  network  has 
been  the  design  of  the  IIT  Computational  Laboratory  (Attachment  2  to  this 
Introduct^ion  and  Summary)  which  serves  in  two  modes: 
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1.  Interactive  compilers,  file 'system,  and  a  CAI  system  from  a 
number  of  'cRT  terminals  with  graphics  capability  as  a  locaJL  / 
computing  facility. 

2.  Programmable  medium^speed  terminal  so  the  instructor ^can 
collect  student  inputs  and  transmit  them  to  other  computers 

^      '    elsewhere  for  batch  processing  and  subsequently  return 

transmission  of  results  in  order  to  access  computer  programs 
no  matter  where. they  are  running, 

A  comprehensive  overview  of  a*vailable  standard  hardware  has  revealed  an 
extreme(jliy  powerful  system  (pomparable  to  ap  IBM  360/40  to  50)  at  surprising 
low  cost.    For  Bxample, sgiven  the  intermediate  16  terminal  configuration 
and  a  five-year  lifetime,  -a  student  contact  hour  at  a  terminal  costs  two 
dollars.  .(The  CRT  terminals  cost  less  than  keypunched.)    However,  since 
the  system  is  designee  to  make  easy  and  flexible  use  of  computer  utilities 
elsewhere,  its  usefifl  life  in  this  rapidly  changing  computer  technology 
actually  extends  beyond  five  years. 

There  exist  two  systems  which,  are  receiving  Rational  attention  ^is  the  basis 
.for  a  five-year  demonstrational  project  in  CAI,  namely,- the  PLATO  system 
based  on  thousands  of  non-standard  ifihovative  terminal?  accessing  a  large 
fcomputer  supporting  many , academic  ins'titutions  at  all  educational  levels, 
and  the  TICCIT  system  based  on  a  local  stand-al,one  jnini-computer  supporting, 
video-cdmpatibl^  terminals  designed  with  the  lower  educational  levels  in 
mind.    Iri'fJBrtj^  the  IIT  Computational  Laboratory  coristjltUtes  the  higher 
educational  counterpart  to  the  TICCIT  system  in  apposition  to  the  PLATO 
system.    However,  it  transcends  both  systems  in  that  an  essential  part  of 
its  design  is  convenient  acfiess  to  application  computer  program^,  no  matter 
in  what  language  they  were  written  and  no  matter  on  whose  compute'^  connected 
to  the  country's  communication  system  they  are  running. 

Recommendation 

Finally,  if  one  had  to  distill  out  of  all  this  experience  the  single  most 
important  recommendation  it  would  be  the  following: 

Innovators  in  specific  disciplines  doing  research  are  discovering  how  lilan's 
problem- solving  and  decision-making  power  is  being  extended  with  the  advent 
of  the  information  processing  machine.    Stron^^ encouragement  needs  to  be 
given  to  them  to  conduct  faculty  training  institutes  and  to  adapt  their 
discipline-oriented  computer-based  innovations  for  use  as  part  -of  high 
school,  college,  and  university*  academic  programs. 


June  30, 
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THE    IIT'COMPUJATIOiMAL  LA.BORAJQRY 

Peter  G.  Lykos 
Presently  on  leave  at  the  Office  Contpu^ting^ Activities, 
National  Science  Foundation 

Robert  B.K,  Dewar 
Illinois  Institute  of  Technology 

\ 

m  OVERVIEW 

.The .compljtesr  is  beginning  to  find  an  Important  place  jn  college 
education*    The  influence  is  not  restricted  to  the  hard  sciences, 
but  extends  throughout  the  curriculum*    However,  as  recent  articles 
indicate  (Educational  Media:  November  1969,  February  1970,  September 
1970:  Proceeding?  of  Conference  on  Computers  in  Instruction:  Their 
Future  for  Higher  Education,  RAND,  October  1,2,3,  1970,  Beverly  Hills, 
California).   The  use  of  computers  has  been  on-only  a  token  basis, 
^even  at  most^of  the  large  universities-    This  is  due  to  two  causes: 
'  1)  The  faculty  h&ve  not  yet  updated  themselves  and  the  curricula  they 
teach  in/ order  to  Incorporate  computer-based  problem-solving  tech- 
iques:  and  2)  Even  -if  they  had  done  so^  the  computer  systems  currently 
available  would  not  support  broad-brush  use  of  the  computer  by 
students  in  a  cost/effective  fashion* 

It  seems  essential  to  incorporate  the  use  of  the  computer  into 
tbe  curriculum  in  an  orderly  scheduled  manner.    It  is  quite  un5;atis- 
factory  to  attempt  to  extend  the  typical  current  systems  in  which 
students  contend  for  a  ^mall  number  of  key-punch  machines  or  ter^iinals-^ 
in  their  spare  time. 

We  have  therefore  devised  the  notion  of  a  Computational  Laboratory, 
consisting  of  a  classroom  of  quiet  CRT  terminals,  into  wh-idi, students 
are  scheduled  as  part  of  their  standard  program  of  study,  whis  lab- 
oratory would  be  used  both  for  computer-based. instruction  and  problem 
solving  sessions  in  a  variety  of  disciplines. 

The  availability  of  low  cost  'mini -computer  hardware  makes  the  estab- 
lishment of  suth  a  facility  practical  at  relatively  low  cost..   The  mini 
computer  system  can , duplicate  most  functions  of  a  large  scale  system 
with  respect  to  the  requirements  for  student  computing. 

An  additional  and  important  degree  of  freedom      the  availability 
of  communications  hardware  which  allows  the  use  of^^a  variety  of  large 
scale  systems,  remote  from  this  installation,  to  handle  more  compli- 
cated problems.    This  latter  appro^h  is  of  particular  irrjportance, 
since  the  possibility  of  using- application  programs  on  the  computer 
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'"system  fdr  which  they  were. designed" allows  thei  establishment  of  a 
direct  link  between  "the  studer\t  in  the  Computational  Laboratory  and 
the  frontiers  of  research  in  universities  across  the  nation.  It 
should  also  be  noted  that  the  concept  of  an  onsite  facility  communi- 
cating with  a  variety  of  lar^systems,  at  the  convenience  of  the 
user,  is  pch  more- attractive        the  more  staridard  arrangement  of 
a  small  nlachine"  which 'is  "tied"  .to  one  large  central  computer. 

The  necess'ary  software  partially  exists  ijj  a  variety  of  systeljis. 
However,  no  system  currently  available  provides  the  necessary  software 
in  an- integratfed  package  designed  specif^ically  for  this  purpose.  It 
is. the  intention  of  IIT  to  generate  the  required  software  system, 
with  a  view  to  duplicating  the  Laboratory  in  a  variety  of  colleges 
and  universities. 

Another  important  factor  with  regard  to'  the  use  of  this  systfem  in 
a  small  college,  or  a  high  school,  is  that  it  would  be  caoable  of 
supporting  the  administrative  requirements  if  suitable  software  were 
available.   Thus  funds  could  be  combined  to  put  the  System  in  reach 
of  many  small  cbl leges. 

No  fixed  cost  limits  have  been  established  for  individual  compo- 
nents.   However,  a  target  cost  of  $200,000  for  an  intermediate  size 
16  terminal  configuration  has  been^ established  for  the  entire  system 
(all  items  are  to  be  purchased  Outright).    In  general,  cost  "is  aq 
important  factor,  since  the  attraction  of  this  system  for  small 
educational  units  lies  in  its  flexible  facili.ties  and  low  package 
price.   The  range  of  purchase  costs  is  from  about  $125,000  for  an 
eight  terminal  system  to  about  $375,000  for  a  forty-eight. terminal  • 
system. 

It  is  difficult  to  assess  the  total  impact  of  such  a  system  on 
secondary  schools  and  on  higher  education.    However,  the  following  may 
be  noted:  *  .  ^ 

1,  A  recent  survey  by  the  American  Institute  for  Research  of  the 
23,000public  high  schools  in  the  .USA  reveals  that  13%  are  al- 
ready using  the  computer  as  part  of  the. educational  profess. 
Presumably  a  similar  percentage  applies /to  the  8,000  private 
•high  schools. 

2.  A  recent  study  by  the  RAND  Corporation  of  instructional  uses 

'  ^    of  the  computer  in  the  2,600  colleges  and  universities. of  the 
USA  reveals  that  over  half  make  significant  use  Of  the  computfer 
as  part  of  the  instructional  process  and",  furthermore,  program- 
mable medium  speed  terminals  (one  of  the.  modes  of  operation  of 
the  IIT.  Computational  Laboratory)  increasingly  vrill  be  used  to 
.  ,en?ible  college  students  to  access  a  variety  of  computer  services 
■  as  a  regular  part  of  their  academic  training.   •  ■ 
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3.  A  recent  development  of  Advanced  Research  Projects  Agency  has 
spanned  the  USA  with  a^m^dium  speed  data  transmission  network 
dedicated  to  interfacing  computers  with*e?jch  other  or  with 

data  terminals.    EDUCOf^' is  examining  thev  very  much  underutilized 
'  ARPA  Net  with  a  view  tp  bringing  it  into  the  service  of  higher 
education  on  .a  national  scale.    ,  " 

4.  The  High-Energy  Physics  Advisory  Panel  to  the  Atpmic  Energy 
Commission,  Qivision  of  Research,  recommendation  that  -the 
CDC  7600  recently  installed  at  the  Lawrenge  Radiation  Labora- 
tory be  operated  as^  a  national  facility,  has  been  accepted  and 
scientific  users  outside  of  .the  b>igh-energy  physics  community 

,  will  also  be  accommodated  on  a  time  available  basis.    The  system 
is  accessible  on  a  Remote  Batch  Service  basis  with  large  on-site 
storage  facilities  available.    As  of  23  April  71.,  federal  agen- 
cies and  cost-type  government  contractors  are  recharged  at  the 
internal  rate  oT  $155  per  hour  fbr  6600  tttne/  Rates  for  the 
*        7600  system  ar^*  not  yet  ^determined,  but  will  be  adjusted  so 
that  costs  for  equivalent  jobs  will  be  less  on  the  7600  than 
.    on  the  6600.  ■    •  ' 

OUTLINE  OF  THE  PLANS 

Introduction  ^'  ^  . 

The  IIT  Computational  Laboratory  is  an  integr^ited  hardware/software 
package  with  local  and  remote  processing  capability  which  is  intended 
to, meet  a  broad  spectrum  of  educationa*]  needs  at  the  college,  as  well 
as  the  university,  and  to  an  extent,  the  secondary  school  levels. 

The  design  is  based  around  a  j:arefully  selected  mini-coinputer 
system  and  derives  from  the  extensive  experience  of  IIT  in  the  field 
of  educational  use  of  computers.    Among  the  important  considerations 
are  the  following: 

(a)  It  is  necessary^  for  psychological  and  other  reasons,  to  have 
local  computing* capability. 

(b)  A  wealth  of  applications  programs  is  currently 'accessible  via 
telephone  lines,  given  a  system  which  can  communicate  with  a 

*  variety  of  large  machines. 

(c)  Computing  in  the  educational  environment  needs  to  be  a 
scheduled  activity  comparable  with  the  laboratory  sessions' 
in  chemijbm^  and  physics  courses. 
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(d)  A  viable  system  must  have  administrative  data .processing 
capabilities  since  there  is  an  existing  colmiitment  here 
in  terms  of  resources  and  personnel. 

1  Erom  these  and  other  considerations,  a  comprehensive  system  has 
been  designed.   The  hardware  is  currently  available  and  the  software  . 
will  be  implemented  at  I IT  over  a  two  year  period. 


]]iejjardwarfr ^ ,  ■  ^  "  ^ 

The  hardware  system  consists  «f  the  following:  * 

36K  16  bit  word  mini-computer 
900  nsec  cycle  time  (TEMPO  1)^ 

2  2314  type  disk  unks  .  \ 

capacity  28,000,000  16  bit  words 

2  tape  units  30  ips.      .  .  '  . 

1  electrostatic  printer 

(700  to  5000  1pm  and  graphics  capability) 

4  synchronous  communication  lines 


The  totaVcost  of  "this  system  is  approximately  $200,000  with  annual 
maintenance  cost  of  approximately  $25,000.    The  useful  lifetime  of  the 
system  is  at  least  5  years.    Thus  the  pro-rate  annual  cost  of  the 
system  is  a  reasonable  percentage  of  the  annual  budget  of  a  typical 
liberal  arts  college. 

Tne  Software      '  ^  • 

I.   The  Administrative  Data-Processing. Package     .  ' 

As  previously  stated,  an  important  component  of  the  Computational 
Laboratory  is  th6  administrative  data  processing  package.    From  a 
hardware  point  of  view,  -the  proposed  configuration  substantia. .y  . 
exceeds  the  power  of  ^  360/30  with-  65K  memory.   Thus,  given  appropriate 
software,  its  capabilities  for  data  processing  use  should  be  adequate 
for  many  purposes, -particularly  in  view  of  tik  fact  that  runs  requiring 


1  card  reader  (400  cpm) 


t 


1  DCT-500  30  ^ps  impact  printer, 

16  programmable  CRT  terminals  with  point  graphics  capability 
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a  large  scale  processor,  for  example  modular  scheduling  applications, 
could  be  processed  remotely •  • 

V^^e  ADP  package  consists  of  four  compd(ients:  \ 

(a)  An  RPG  processor  "for  simple  fileNwdHltenance  apolications- 

(b)  A  small  COBOL  processor  downward  com|)atible  from  ANSI 
,  standards* 

(c)  A  sort  package  for  disk  and  tape  files • 

[dY  An  indexed  sequential  file  management  systeip^  The. 
logical  organization  provided  by^  this  type  of  system 
is  ide^l  for  many  applications  in  which  permanent  files 
are  maintained  on  disk. 

V; 

*'TI,    The  Desk  Calculator      .  ... 

The  desk  calculator  simulator  gives  the  terminals  the  capability 
of  being  used  as  sophisticated  electronic  calculators  with  program-  ' 
mi ng' capability.    This  might,  for  e^^mple,  be  used  to  provide  a 
highly  effective  substitute  for  the  classroom  of  calculators  sometimes 
used  in  the  teaching  of  statistics. 

An  iTiteresting  feature  of  the  calculator  is  that  it  is  implemented 
entirely  within  the  programmable  terminals.   This  ensures  minimum  load 
on  the  processor  and" gives  guaranteed  instantaneous  response.    To  give 
an  idea  of  the  level  of  sophistication  of  this  package,  it  *is  possible 
to  store  the  program  and  data  to  solve  a  4  x  4  jystem  of  simultaneous 
equations  by  Gaussian  elimination.     ►     '  »^       "         ^  ^ 

III.    The  File  System  "  v  ' 

Many  parts  of  the  system  are  based  on  the  concept  of  easily 
manipulated  files  which  reside  on  the  disk  storage. 

The  programmable  terminals  are  usecl  to  provide  an  extremely  powerful 
editing,  system  which  is  very  easy  to  use.    Basically,  the  screen  acts 
as  a  window  scanning  .through  the  file,  and  the  information  is  modified 
on  the  surface  of  the  CRT  using  a  range  of  editin?  functions,  includinq 
character  insert/ delete,  line  insert/ delete,  field  skip/duolicate,  etc. 

This  'frame  processina'  approach  to  feditinq  is  powerful,  easy  to  use 
and  requires  very  little  attention  from  the  CPU  thus  assuring  raoid 
resDonse.    A  similar  system,  NUROS,  is  in  use  at  the  University  of 
.'iebraska  using  2260  terminals  on  a  'large  IBM  360/65. 
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Suitable  system  commands  allow  fiTes.to  ,be  created,  ccTp'ied",  updated, 
extended,  saved  on  tape,  pr*inted,  etc.  ^  -  . 

IV.  Remote  Access  System 

The  Remote  Access  System  provides  comprehensive  hardware  ^and  soft- 
ware support  for  accessing, a  variety  of  large  scale  remote  computers. 
A  processor  language  which  is  being  designed  allows  creation, of  a 
remote  entry  module  (REM).    The  REM  basically  performs  two  functions: 

(a)  An  input  file  (from  Disk)  is  converted  into  suitable  form  for 
use  remotely  (by^ reformatting.,  adding  control  cards,  translating 
charafcter  sets,  etc.)  •  • 

(b)  The  received  output  is  converted  to  a  usable  Ibca^l  file  (by 
*  removing  extraneous  system  messages,  etc.). 

V.  Local  Conversational  Processors\^ 

A  variety  of  "compile  and  go'  processors  will  be  implemented. 
These' will  either  run  conversationally,  with  input  and  output  directly 
to  the  terminal,  or'the  input  and  output  can  be  associated  with 
prenamed  files.    In  the  latter  case,  the  output  could  be  printed  on 
the  hi.gh  speed  printer. 

.  Initially,- the  design  i,nc'ludes  two  such  prodessors: 

(a)  IITRAN,  the  student  oriented  language  designed  and 
implemented  at  TIT.  •  f 

(b)  '^NOBOL-4,  the  string  .processing  language  developed  at 

.  ■■■   Bell  Telephone  Lat?oratories.  '  A  recent  implementation  ■\ 
of  SNOBOL-4,  called  SPITBQL,  for  the  360,  done  at  IIT, 
•shows  this  language. to  be  practical  on  a  machine  of 
this  size. 
V  ' 

•  It  is  anticipated  that  other  processors  in  this  class  will  be 
implemented  as  various  students  become  involved  with  the  project. 
Based  on  current  interest,  these  might  include. BASIC,  FORTRAN,  APL, 
LISP,  ALGOL-68. 

VI.  The  CAI  Package 

A  language  for  applications  in  computer  assisted  instructipn  will 
'  h?.  part  of  the  system'.    Here,  extensive  use  of  the  large  capacity 
disk  storage  will  be  made  for  staring  programs  andiessons.  .Also, 
the  capability  of  the  programmable  terminals  will  be  used  to  provide 
frame  oriented  display  including  graphics. 
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The  exact  form  of  the  CAI  language  has  not  be  determined.  A 
requirement  is  that  it  be  simple  for  non-computer  oriented  faculty 
members  to  devise  and  implement  course  materials.   A  model  of  the 
type  of  language  under  consideration  may  be  found  in  the  .UNIVAC 
COPI  language  which  is  in  wide  use  in  vJuried  applications. 

HARDWARE  CONFIGUATIONS  AND  APPROXIMATE:  PRICES  *  (May  1,  1971) 


1. 

8  terminals,  page  printer,  1  disk,  1  taoe,  24K  core.  * 

'  $122^,000 
$161 ,000 

2. 

12  terminals,  page  printe»>>  1  disk,  1  tape,  Gould 
printer/plotter,  32K  core                       *  ^ 

+3. 

16  terminals,  page  printer,  2  disks,  2  tapes,  Gould 
printer/plotter,  36K  core 

$205,000 

4. 

24  terminals,*  page  printer,  2  disks,  2  tapes,  Gould 
printer/plotter,  40K  core 

$237,000 

5. 

32  terminals,  page  printer,  2^ disks,  2  tapes,  Gould 
printer/plotter,  48K  core      '  , 

$268,000 

6. 

48  terminals,  page  printer,  3  disks,  2  tapes,  Gould 
printer/plotter,  64K  core 

$354,000 

\ 


*   These  prices  ii'^Uide  hardware  only,  software  adds  about  10%  (estimated) 

+  The  configuration  illustrated  on  the  following  page  -and  all  subsequent 
pages  refef^  to  this  configuration. 


7-29 


HARDWARE  CONFIGURATION 


INCOTERM 
.  "[PU 
4K  Bytes 


^  T"  X 

I NCOTERM  Programmable  CRT ' s 
960..Cliarajgter  Display  and  Point  Graphics 
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/  '      7-30  ■  .  ' 

TEMPO  1  CPU  &  MEMORY  CHARACTERISTICS 

Memory:  36K  16  bit  words  {72K  bytes),  cycle  time  900nsec. 

# 

Register:    .   16  general  purpose  16  bit  xregs 

10  non-progranmable  special  purpose  registers 

'  Instructions:  full  set  of  memory  to. register,  and  register  to 

register  instructions  including  full  set  of  logical 
^     '  and  shif^t  instructions 

Addressing:     13  modes,  including  double  indexing,  »ind|rect  reference 
and  table  search 


Timing:  load  from  memory  1.8  usee. 

'  r'egister-register  0.9  usee.  v. 

mutiply/divide  6.0  usee.  ' 

Input-Output;  maximum  total  rate  700,000  * 
16  bit  words  per  second 
12  nested,  maskable,  interrupt  levels 

TERMINAL  SUBSYSTEM  '  '  '    -  / 

Type:  INCOTERM  SPD  10/20  Dual 

Display:         15  x  64  »  960  characters  or  60  x  126  points'  (or  mixture) 

Two  displays  are  controlled  by  one  SPD  10/20  TPU.    The  TPU  characteris^ticlfe 
are  as  fallows: 

Program  •  '  <  .  . 

Registers:  3^ 

* 

Instruction  Set:  Includes  arithmetic,  logical,  branch,  and  I/O  instructions. 
Speed:  Memory  1.6  usec.  Load  Instruction  3.2  usee. 

Core:  2K  16  bit  words  (IK  woVds  used  for  refresh) 
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CARD  READER  SUBSYSTEM 


Format:  Standard  80  column  cards,  Hollerith  or  column  binary 

Speed:  400  c.p.m. 

Capacity 

(hi^pper):  500  cards 


pm^ER  SUBSYSTEM 
Type: 
Speed: 


Resolution: 


Character 
Sets: 


Paper  Size: 

Paper  Type: 
DISK  SUBSYSTEM 
Drives: 
Format: 
'  Accejss : 

Speed : 
Capacity: 

'  Pa^ks: 


Electrolytic  (dot  matrix)     *      .  - 

4,800  l.p.m.    (Character  lines) 
40,000  l.p.m.  (dot  lirtesj 

/./ 

80  dots  per  inch  (full  graphical  capability,  includin 
special  forms  generation) 


Hardware  fonts  for  ASCII  cbarc  in  S  x  7  or  8  x  10 
matrix  (horizontal  or  vertical),  ' 
Software  can  print  any  chars  using  a  program 
defined  dot  matrix  set. 

11  inch  (roll  or  fanfold) 

J132  chars  t\prizpntal  with  5x7  font) 

Special  coated  paper  (cost  is  less  than  2^  per  page) 


Two  -  simultaheous  seek  capability^ 

IBM  compatible  fixed  length  sectors  (346  bytes) 

35  msec,  average  head  positioning. 
12.5  msec,  average  rotational  delay 

156,000   16  bit  words  per  sec9nd 

11,200,000   16  bit  words  per  drive 
(totail  capacity  45,000,000  chars) 

Standard  IBM  2314  type  (removable) 
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TAPE  SUBSYSTEM 
Drives: , 
Format:  " 
Density: 
Speed : 
Capacity: 


two  -  simultaneous  rewind  capability  ' 
IBM  compatible  9-triick 

SeOO  BPI  ,  .  ^ 

25  i.p.s..  =  30 ,€00  chars  per  second' 

2,400  ft.  standard  reels 

COMMUNICATIONS  SUBSYSTEM 

Type:  Synchronous        "  .  • 

Speed:  ■  2K  baud  (phone  line)  "         ,  " 

A  variety-of  channels  provide  sufficient  flexibility  to  conmunlcate  with 
a  wide  range  of  large  scale  computers  including  IBM  360,  UNIVAC  1108, 
CDC  6400,  6600,  7600.        '  -  ^ 
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The  Eastern  Michigan  University  Center 
for  the 

Exchange  of 'Chemistry  Computer  Pro(|rams ; 
EMU-CECCP 


by 


Ronald  W.  Collins 
Chemistry  Department 
^a^tem  Michigan, University  / 
Ypsilanti,  Michigan 


4^7 


-Many  chemistry  departments  are  now  utilizing  some  form  of 
computer-aided  instruction  in  their  coursework.    This  use  ranges 

•  from  spphisticated  interactive  computer-assisted  instruction  (CAI) 
to  the  simplest  form  of  non-interactive  computer  applications 
(NICA),  namely  data  reduction.    Regardless,  of  .the  exact  nature  of  the 
computer  usage,  however,  there  is  an  emerging  need^'r.  many  de- 
partments for  a  library  of  computer  programs  to  implement  the-lr 
planned  computer-aided  instruction.    Most  chemistry  instructors  • 
have  been  meeting  %\{\%  need  by  writing  their  own  computer  pro- 
grams, specifically  for  their  own  pedagogical  purpose  and  designed 
exclusively  for  their  own  computer  system.    To  d^teconmunf cation 
within  the  academic  chemistry  community  on  such  efforts  has  been  I 

•  relatively  poor.    This  has  lead  to  considerable  duplication  of 
effort  in  the  writing  of  essentially  identical  programs  for 
basically  identical  purposes  ait  different  institutions.  Admit- 
tedly  the  portability  of  computer  programf^is  often  hampered  by 
systems  variations  among  different  machines;  however,  considerable 
timfe  can  still  be  saved  by  eliminating  repetition  in  developing 
algorithms  for  given  chemistry  applications,    tn  an  attempt  to 
provide  a  service  which  will  help  to  eliminate  some  of  this  ► 
needless  duplication  in  programming,  the  off-line  Eastern  Michigan 

.  University  Center  for  the  Exchange  of  Chemistry  Computer  P^'oqrams 
(EMU-CECrp)  has  been  established.  '  •  ' 

*The  EMU-CECCP  was  originated  in  1970  with  emphasis  strictly  ■ 
on  computer  programs  of  an  educational  nature;  i.e.,  programs 
which  could  readily  be  used  In  chemistry  courses,  rather  than 
highly  specialized  research  programs.    The  rationale  for  creating  > 
the  EMU-CECCP  goes  beyond[,  however,'  the  simple  expediency  of  ex- 
changing proven  documented  programs,  to  include  also  the  hope  that,  ' 

vthe  center  will  serve  to  cVeate  an  awareness, of  professional  ex- 
pertise in  given  areas  of  computer  applf cations  to  chemical  probr 
lems.    In  addition  it  is  anticipated  that  the  EMU-CECCP  can,  by 
hosting  periodic  conferences ,  serve  to  promote  the  exchange  of 
ideas  among  chemists  on  the  role  of  computers  in' chemistry  edu- 
cation- The  EMU-CECCP  is  at  present  internally  financed  by 
EastemV^ichigan  Univers'ity,  and"  operated  via  the  part-ffme  effprts  • 
of  several  EMU  chemistry  faculty  and- graduate  students.  The 
geoc(raphical  scope  of  the  organization  has  been  primarily  the  - 
state  of  Michigan  along  with'several  neighboring  Midwestern  states. 

The  EMU-CECCP  does  not  represent  a  unique  concept;  i.e.,  the 
basic  idea  of  a  program  exchange  is  not  original  to  this  organ- 
ization.   Consequently,  the  strength  of  the  EMU-CECCP  does  not  lie  • 
in  Its  basic  originality,  but  rather  in  the  heavy  emphasis  being  - 
placed  on  effective  .indexing  and  documenting  of  all  programs 
cental nec  in  the  library.    Each  EMU-CECCP  program  is  thoroughly 
describee      a  "User  Description",  which  is  an  8-10  page  document  - 
covering  the  theory  and  use  of  that  particular  program.    The  exact  * 
format  for  the  User  Description  varies  from  one  program  to  an- 
other but  the  essential  contents  remain  the  same.    Included,  are  ' 
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a  flow  diagram,  symbol  'chart,  4:omplete  program  listing,  instructions- 
for  data  card  preparation,  and  a  sample  problem  to  check  the  program 

execution.         ^  '  ^.  ^ 

The  EMU-CECCP  disseminates  information  regarding  its  inventory 
of  programs-  by  issuing  semi-annual,  numerical  andVjceyword  indexes. 
Each  program  accepted  for  inclusion  in  the  EMU-CEijCP  library  is 
cataloged  under  both  a  program 'name  and  an  assigned  number,  and 
its  function  is  described  by  not  more  than  five  keywords.  These 
"keywords"  can  be  either -English  words,  chemical  symbols,  or  appro- 
priate mathematical  expressions.  Both  an  alphabetical  Wsyword  ind^x 
and  a  numerfcal  11st  of  all  available  programs  is  then  published; 
To  date  these  indexes  have  been  supplied  gratis  tb  al]  63  Insti- 
tutions of  higher  learning  Ia  the  state  of  Michiganv  plus  other 
interested  individual's  who  have  been  referred  to  mir  organization. 
The  suggested  sequence  for  efficient  use  of  these  index  qiaterials 
Is  as! follows:  ,  ,v  • 

'1.  Consult  the  alphabetical  keyword  index  (Figure  1.) 
-'     '      for  terms  indicative  of  the  apptlicatioij  you  have 
in  raindi^-note  the  program  number(s)  corresponding 

to  these  terms.  ,  «\ 

■  2.  Consult  the  Numerical  list  of  nrograms' (Figure  2) 

under  the  appropriate  number (s)  to  bbtain  more 

Information  about  those  programs  which  appear  to 

.  be  of  Interest  to  you.   This  list  describes  the 

^  '  input  and  output,  as.  well  a.s  the  progrannlng 

language  for  each  Computer  program. 

3.  Based  on  this  additional  information,  if  a  given 

program(s)  is  jpf  definite  interest  to-yoa,  request  " 

from  the  EMU-CECCP  the  user  description  for  that 

.program(s).   A  form -for  such  requests  is  supplied 

for  the  convenience  of  all  EMU-CECCP  subscribers. 

At  present  the  function  of  the  EMU-CECdP  terminktes  at  this  point; 

i.e.,,  the  organization  is  not  supplying  duplicate  source  decks  for 

programs  (requests  can  be  direq^d  to, the  program  author',  but  not 

to  the  EMU-CECCP)  nor  is  it  attempting  to  become  involved. tn  * 

-problems  of  systems  capab^l'ity.   Tiie  EMU-CECCP,  does^  however,  serve 

as  many  users  as  possible  by  including  in  our.. library  essentially  . 

duplicate  programs  written  in-different  programming  languages. 

-    '  The  organizational  format  6^  the  EMU-CECCP  seems  to  be  both 

efficient  an$  reasonably  effective  in  meeting  the. needs  of  our  ^ 

various  users;  however,  several  problenls  have  arisen.  First, 

program  authors  iliustWri'^e  the  user  descriptions  for  their  ow 

programs  rather  than  depending  on  the  EMU-CECCP  staff  to  prepare 

these  documents.  "  Originally  this  v/as  not  the  plan,  but  it  has  been 

shown  rather  conclusively  that  an  %ufhor  cap  document  his  own 

program  ir.  considerably  less, time,  thereby  facilitating  more  rapid 

accessibility  to  his  programs  via  their  earli.er  inclusion  in  the 

HMU-CECCP  index.   A  second  probl-em  has  beeO  communicating  with 


potential  program  donors  who  nrtght  wish  to  make  their  computer 
programs  available  to  other  chemists  but  who  are  not  aware  of  the 

EMU-CECCP.   For  this  reason  the  l?MU-CECCPJs  being  extensively 
PMbllcized  via  national  and  regional  meetings  of  the  American 
-Cheraica'i  Society  as  well  as  through  specialized  computer-oriented 
conferences. 

Regiirding  other  future  plans,  the  EMU-CECCP  will  continue  to 
be  only  an  off-line  source  of  computer  programs  for  chemistry 
edutatlon;  i.e.,  there  are  no  plans,  to  provide  user  technology 
.in  the  form  of  on-line  access  tS  bur^rogram  library  via  a 
time-shared  network  operated  tokJiJe  EMU  campus.   It  is  our 
contention  that  efficient,  ecdhomiciA  time-sharing,  is  available 
through  a  variety  of  sources  rand  that  the  efforts  of  the  EMU-CECCP 
staff  are  better  directed  toward  developirtg/and.  operating  the  best 
possH)le  off-line  educational  program  exchange.  A  review  of  our 
current  library  holdings  does  reveal,  however,  that  the  majority 
of  the' EMU-CECCP  programs  are  written  for  batch  processing.  Thus, 
an  attempt  ts  currently  being  made  to  solicit  both  conversational 
interactive  ,CAI  programs, and  pseudo- Interactive  programs  whose 
input  and  output  formats  are  compatible  with  remote  teletype 
terminal  operation. 
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Keyword  Index  to  Computet  Programs 

(listed  by  ^rogz'am  number;  entries  In  oapltal  letters 
are  program  names  rather  than  l^eyvords) 


ACETIC  AClD  -  6  • 
^  Acid,  weak  -  25 
Ammonia,  aqueous  -  2A 
ANISTROPY  -  13 


JIA:    OBJ  -  25 
Heat  capacity  -  11 
H202>  decomposition  of  -.23 
[H+]  -  10 


Base,  strong  -  25 

Benzene,  derivatives  -  14,  15 


INTERPLANAR'  -  A 
lONICON  -^  8 


Coefficients,  dlstrlbijftlon  -  3Q 
Coefficients,  equilibrium  -  30- 
Coefficients,  extinction  -  16 
Complexes,  spectra  -  32 
Constant,  equilibrium  -  7 
Constant,  Ionization  -  6,  8 
Constant,  stability  -  16 
Constant,  Van  der  Waals  -  21 
Correction,  calorlmetrlc  -  11 
Curve  fitting  -  20 


DELH  -  17 
Derivative,  method  r  1,  28 
DEkTI  -  1  ) 
Deviation,  percent  -  29 
DIPOLE  FIELD  -  12 
Dlprotlc  acid  -  8 
DJPR  -  30 
DSPAC  -  18 
DSPACE  -  DELSQ  -  3 
'D-Spaclngs  -  3,  A,  5,  18 


RDTA:  OBJ  -  26 
Electrode  potential  -  ^A 
Electronic  absorption  -'32 
Energy,  transition  -  32 
ENTHALPY  -  11 
EQUIC0N-F5SCN  -  7 
Equlllbrlum^onscant  -  7 
Extinction  coefficient  -  16 


Kinetics  -  2,  31  '  ' 

KSPAGAC  -  9 

Least  sqxiares,  curve  fitting  -  19,  20,  27 
.  Least  3quares,  errors  -  19 

Least  sqxiares,  linear  -  20      *  • 

Least  sqxiares  -  19,  20,  27. 

Least  sqxiares,  weighted  -  17  ' 

Llgand  field  -  32  , 
,  LINLEAST  -  19  • 

LSQ:  ^ORr-  27 


McWeeny  method  -  lA  ^ 
Miller  indices,  random  -  5 
MILLER  SPACINGS  -  5 
Molal  volume  -  22 
Monoprotlc  acid  -  8 
M+2  -  26  ^ 


Nemst,  eq.  ~  .2A 
NERNST:   'OBJ  -  2A 


OHCO  -  32 


Polynomials  -  27 
Position  number  -  12 


er|c 


Field  shift  -  12 


Guggenheim  method  -  23 
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RATE  -  23 

Rate  constants  -  2,  23,  31 
REGRESSION  -  20 
RING  CURRENT  (I)  -  14 
RING  CURRENT  (II)  -  15 
Rotating  bond  -  12 
Rotational  increment^ -  13 

Figure  1 
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.Abstract 

For  the  past  few  years  meuibers.  of  varioi^  departments  at  the  State 
University  of  New  York  at  Bin^amton  have  bien  ejq^loririg  the  applicability 
of  APL  (A  Prograjnming  Language)  td^instructional  use  in  the  undergraduate 
curriculum.    APL  is  a  particulasdly  attractive  language  for  computer- 
assisted  instruction  because:    1)  it  permits  direct  and  imme^ate  inter-  * 
action  between  thei  student  and  the  con5)uter;  2)  the  student  can  learn  to 
use  the  languiage  with  a  minimum  of  instruction;  and  3)  it  nrot  only  allows 
programming  in  the  conversational  mode,  but  also  has  considerable  confuting 
logical/  an4  plotting  c^abilities.    This  p^er  outlines  oicr  e^erience 
with  APL  in  the  chemistry  curriculum.    In  particular,  a  fairly  involved 
.program  used  by  our  students  in  the  General  Chemistry  Laboratory  e3q)eriment 
Molar  VoluMe  of  a  Gas,  is^  ^escribed\n  some  detail.    Included  is  a  dis- 
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Introduction: 

The  past  few  years  have  witnessed  a  greatly  increased  effort\b 
realize  sonie  of  the  educational  potential  presented  by  the  development 
of  conqputers*    A  jiuaber  of  different  systems  and  languages  have  been  tried 
in  coii5)ute?:-aided  instruction,  with  varying  degrees  of  success  and  promise. 
One  of  the  nost  promising  languages,  in  our  view,  is  APL  (1),  a  programming 
language  which  allows  direct  interaction  ^tween  the  user  and  th^  computer 
by  typewriter  communication. 

Recently  a  grant  from  the  IBM  Conpany^.permitted  an  exploration  of 
the  applicability  of  APL  to  instructional  use  on  the  college  level  at  our 
institution*  •  Since  then,  faculty  members  in  chemistry,  physics,  biology, 
geology,  mathematics,  philosophy,  and  music,  have  taken  advantage  of  this 
opportunity*    Although  frustfrated  at  times  with  the  amount  of  time  re- 
quired ^o  develop,  write,  and  de-bug  useful  programs,  those  involved  in 
this  exploration  have  for  the  most  part  remained  enthusiastic  about  the 
use  of  computers  in  undergraduate  teaching,  and  about  APL  as  a  useful 
language  for  cpn5)uter- assisted  instruction.    In  this  paper  s^me  of  our 
efforts  and  experiences  in  the  use  of  APL  in  chemistry  courses  at  the  State 
University  of  New  York  at  Binghamton  will  be  described. 

It^is  easy  to  be  enthusiasticinitially  about  uiing  computers  in 
teaching.    The  conputer  may  indeed  provide  a  new  dimension  ih  teaching, 
although ^care  should  be  Used  in  its  application  (2).    Furthermore  famil- 
iarity with  that  "mechanical  monster**  which  may  unwittingly  send  bill  after 
bill,  month  after  Bttnth,  despite  desparate  plea^  and  claims  of  paytaent, 
and  which' has  the  awesome  power  of  ruining  otherwise  perfect  credit  ratings, 
is  certainly  desirable  for  both  teacher  and  student  1    It  is  also  probably 

*  • 
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advantageous  for  studlhits  in  science  to  become  acquainted  with  the  com- 


putcr  and  its  applica:tions  at  an  early  stage  in  their  training. 


A  variety  of  program  types  for  instructional  use  has  been  suggested  (3) 
Ihese  include: 

4    (a)    programs  for^data  reduction  and  statistical  treatment  of  ex- 
Jjerimental  ^data;  ' 
0>y   the  simulated  lab  program,  vhich  allows  the  student  to  con- 

> 

.  /  centrate  on  the  interdependence  of  physical  variables  or  to 

examine  parameter  relationships  in  physical  models'; 
(c)    remedial  drill  programs  or  extensive  tutorial  programs  in  which 
the  coiq)uter  is  the  ic^ersonal  and  infinitely  patient  instructor, 
.  .  allowing  the  student  to  progress  without  embarrassment  and  at 

his  own  pace  through  particular  subject  material,  all  the  while 
wisely  questioning  the  student  to  test  his  understanding  of  the 
material.    This  last  type  is  undoubtedly  the  most  difficult 
kind  of  program  to  write.  ' 
But,  as  most  of  you  know,  it's  a  long  road  frdm  such  visions  to  useable 
programs  1 

API  Language: 

Even  before  defining  firm  objectives  for  efforts  in  CAI/ one  must  con- 
sider the  computing  system  available  and  the  choic^p  of  program  language. 
Ideally,  the  language  should  be:    1)  one  which  is  easy  to  program  and  easy 
to  use,  2)  one  which  allows  direct  and  immediate  communication  between  the 
user  and  the  computer,  3)  one  which  permits  extended  conversational  as  well 
as  computational  operations,  and  4)  one  which  accommodates  auxilliary  in- 
teractive facilities  such  as  slides,  tapes,  and  CRT  display.    Perhaps  APL, 


more  than  any  other  current  language,  comes  closest  to  this  ideal. 

APL,  A  Programming  Language  (1),  was  first  developed  and  used  at  IBM 
ifi  the  early  1960^5.    It  is  an  interactive^  algorithm-orien^d  time-sKaring 
language  with  a  single,  consistent  notation  for  many  areas  of  discourse. 
By  means  of  the  typewriter  terminal  the  user  can  define  and  store. programs 
and  execute  system  commands  or  mathematical  statements,  and  receive  an  al- 
most immediate  response  from  the  computer.    The  primitive  functioos  and  ^ 
Operations  in  APL  offer  great  flexibility  and  simplicity  in  mathematical 
and  logical  operations  -  particularly'' in  manipulation  of  vectors  and 
matrices.    Not  only  can  the  language  handle  fairly  sophisticated  calcu- 
lations  and  mathematical  operations,  it  is  also 'pre  lativ^ly  6asy  to  program* 
In  addition^  APL  has  a  graphic  display  capability  in  it&  plot  functions, 
and  allows  programming  in  the  extended  conversational  mode  so  that  instruc- 
tions may  be  given,  and  questions  asked>  in  alphabetical  statements  as  well 
as  in  mathematical  formulations.    Finally,  programs  can  be  easily  edited 
and  modified,  and  can  ^e  stored  in  public  libraries  for  ready  ac;.cess  by 
any  user*    No  class  time  is  needed  for  ccJmputer  instruction.    Brief  hand-, 
out  i.nstructions ,  si^plemented  by  individual  instruction  at  the  terminal^ 
'have  been  sufficient  to  allow  the  student  to  use  the  programs  which  have 
been  made  ayai'lable.    At  State  University  of  New  York  at  Birighamton,  those 
students  who  are  interested  can  attend  a  set  of  video-taped  lectures  on  APL 
which  are  shown  several  times  a  year.    Because  of  the  relative  simplicity 
of  the  language,  a  student  (or  faculty  member)  who  has  attended  only  a  few 
of  these  lectures  can  begin  to  write  his  own  programs. 

We  have  experimented  with  the  use  of  APL  in  the  Chemistry  Curriculum 
at  every  undergraduate  and  graduate  level.    Program  tjjges  inxlude  laboratory 

437 


, ,     '  .    ..  •  9-6-:  ^ 

simulation,  laboratory  data  reductjfbn, ,  and  statistical  treatment  o£  ex- 
perimental data  (43.    We  have  yet  to  complete  a  tutorial-type  program 
in  chemistry,  although  we  a^e  currently  Working  on  one  hn  the  mole  con- 
cept and  stoichiometry • 

Initially  we  were  opfejating  with  an  IBM  360/40  computer  with  a 


\ 

limited  number  of  teiminals,    API  was  available  only  three  hours  per  day; 

Consequently,  only  a  fairly  small  number  of  students  were  involved  in  the 

« 

use  of  our  programs.  (It  is  interesting^  to  no^e  that  the  terminals  were 
in  ^e  90%  of  the  tinve  when  APL  wa?  available,  compared  to  approximately 
10%  use  during  rpmote  FORTRANJiours.)^  Recently,  we  have  acquired  an  IBM  1 
360/67  computer,  and  APL  is  now  available  on  campus  eight  hours^  per  day. 
Currently  we  have  '24  remote  typewriter  terminals  (#2741)  on  campus  and 
seven  off-canpus.  ^lis  year,  for  the  first  time,  all^  of  the  students  in, 
general  chemistry  (over  300)  used  the  con^uter.  .  * 

General  Chemistry  Programs;  '  V 

Lab  Simulation;  >  Initially  students  in  one  of  the  second  semester 
freshman  chemistry  recitation  sections  worked  with  two  titration. programs 
developed  by  Science  Research  Associates  (Chicago,  Illinois),  and  somewhat 
modi,fied  by  lis.    These  programs  allow  the  students  to, simulate  ,apid*-base 
titrations  for  a  variety  of  weak  acids,  and^turbidimetric  titrations  to 
obtain  solubility  product  information^ for  a  number  of  slightly  soluble 
salts'.  •  '  '    ,  N 

The  program  TITRATE  randomly  ch(/oses  a  pK  within  2k  specified  range  / 
and  asks  the  student  to  specify  his  initial  titration  parameters  -  i.e., 

molar  conj;H^ration  of  both  acid  and  base  and  voltunexof  acid.    The  sttident 

1 

then  ad4s  aliquots  of  base,  and  the  computer  calciilates  the  pH  after  each  ^ 
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I  , 
addition.    If  an  error  is  made  during  the  course  of  the  titration  (ex.: 
too  few  points  in  the  region  of  the  endpoint)  a  reset  to  zero  conditions 
is  easily  made,  and  the  titration  is  repeated.    The  titration  record  is 
saved,  and  at  the  end  the  student  can  choose  to  see  a  graph  of  pH  vs, 
yol^fe  of  titrant  for  his  titration.    We  have  recently  added  options  for 
first  and  second  derivative  graphs,  expanded  in  the  region  of  the  endpoint. 
We  plan  to  expand  this  program  to  include  treatment  of  polyprotic  acids  and 
buffer  solutions. 

The  program  KSP  extends  the  student's  e;q)erience  with  equilibrium 
systems,  again  through  ^fl^ulated  titration  experiment^    In  this  case  the 
endpoint  is  indicated  by  "lasting  turbidity  upon  shaking",  and  the  KSP  is 
calculated  from  the  titration  data.  .  . 

Response  from  the  limite'd  number  of  students  involved  with  these  pro- 
grams  was  enthusiastic*    (In  fact  several  volunteers  were  coming  at  7:00  A.M. 
in  order  to  use  the  conputer!)    Although  this  type  of  program  does  no^t  pro- 
yide  experience  with  lab  technique,  it  can  help  the  student  grasp  the  basic; 
concepts  involved  in  titrimetric  analysis^  " 

\ 

Statistical  Treatment  of  Data; 

Later  in  the  year  a  s'et  of  simple  kinetics  programs  was  made  available 
to  introduce  the  students  to  statistical  treatment  of  experimental  data» 
The  student  supplies  the  data  (either  from  assigned  problems  or  from  lab- 
oratory experiment)  together  with  an  estimate  of  error  in  both  concentration 
and  time  values.    The  kinetics  programs  apply  simple  first  order  and  second 
order  treatment  to  the  data,  calculate  the  best  straight  line  fit  for  the 
data  by  a  sigiple  least  squares  treatment,  and  print  out  the  rate  constant 

•  \ 
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calculated  from  the  slope,  and  the  standard  deviation.    Again,  the  student 
can  choose  to  see  both  firs t\^d* second  order  graphs  of  the  data  and  can 
con5)are  the  rate  constant  value  from  the  least  squares  treatment  to  the 
one  he  has  obtained  from  his  ov^ ^h and^ drawn  gr^h'  of  the  same  data. 

An  additional  p-rogjram'  in  this  set  is  used  for  the  calculation  of  the 

activation  energy  from  the  temperature  dependence  of  the  rate  constant. 

* 

In  this  program  the  student  can  investigate  the  effect  of  error  limits  on 
the  activation  energy  and  standard  deviation.    Initially  the  computer  cal- 
culates the  best  fit  by  a  siii5)le  least  squares  method  and  prints  out  the 
activation  energy  from  the  slope  of  this  line  as  its  first  approximation. 
The  results  of  this  calculation  are  then  used  to  solve  cubic  equations 
from  a  more  sophisticated  line  fitting  treatment  by  a  series  of  successive 
approximations.    When  successive  values  differ  by  less  than  1%  the  calcu- 
lation  is  germinated  and  the_standard 'deviation  is  printed. 

Again,  only  a  small  number  of  students  these  programs,  and  we 

have  as  yet  no  valid  measurement  of  their  utility.    At  the  very  least  we. 
dan  say  that  these  istudents  were  introduced  early  to  one  of  the  inportant 
uses  of  the  computer  for  experimental  data  treatment,  and  to  the  fact  that 
experimental  results    should  always  include  some  indication  of  estimated  or 
calculated»experimental  error. 

Laboratory  Data  Evaluation  and  Reduction: 

Our  most  ambitious  program  at  the  freshman  level  is  one  which  analyzes 
and  stores  experimental  data  obtained  by  the  studeht  in  the  laboratory  de- 
termination of  the  molar  volume  of  oxygen.    The  purpose  of  this  laboratory 
experiment  is  to  teach  the  student  quantitative  techniques  in  working  with 
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gases  and  to  verify  Avogadro's  hypothesis  concerning  gas  behavior*  The 
student  weighs  out  a  sample  of  a  mixture  of  potassium  chloride  (XCl)  and 
potassium  chloracte  (KCIO3)  (%  Unknown)  and,  with  appropriate  apparatus 
which  he  has.  assembled,  decomposes  the  KCIO3  by  heating,' and  collects  and 
measures  the  o^gen  gas  evolved.    From  the  primary  data  <:ollected  (weights, 
volume  of  water  displaced,  temperature,  barometer  reading)  the  student  can 
then  calculate  a  molar  volume  for  oxygen  at  standard  temperature  and  pressure 
and  the  %  KCIO3  in  his  sample, 

Ihe  Q:q)eriiaental  procedure  is  a  fairly  demanding  one  for  students  at 
this  level,  and  the.re  is  plenty  of  opportunity  for  experimental  error.  Six 
pieces  of  primary  data  mu§t  be  obtained.    On  these  the  student  performs  a 
number  »of  calculations,  including  aqueous  vapor  pi^^ure  and  barometer 
scale  e:q)ansion  corrections,  before  he  arrives  at  his  rinAl  answer.  Typically, 
a  student  would  carry  out  one  determinaXion^ai\d  j.ust  finish  ttis  calculations, 
by  the  end  of  the  lab  period,  only  to  find  that  ni^JPinal*  answer  is  un- * 

♦ 

acceptable  and  he  must  repeat  the  experiment.    Clearly,  it  would  be  help- 
ful to  the  student  to  hkve  an  immediate  check  on  the  accuracy  of  his  primary 
data.  ,  . 

The  objectives^  o^  the  computer  programs  in  the  CHEMLAB  workspace  are: 

1)  To  allow'  the  student  to  check  immediately  the  accuracy  of  his  ^ 
experimental  measurement  of  the  volume  of  oxygen; 

2)  To  allow  the  student  to  check  his  calculations  of  molar  volume 
aftd  %  composition  for  .his  particular  sample; 

3)  .  To- store  student  primary  data  and  results  for  subsequent  analysis 

and  grading. 


The  initial  iii5)etus  fpr  this  program  come  from  a  paper  preseivted 
at  the  Synposium  on  the  Status  of  Conputer-Assisted  Instruction  in  Chem- 
istry, held  at  Toronto  in  May,  1970  (S).    In  this  paper,  D.S.  Olson  and 
colleagues    described  the  use  of  conputer  pvindi  cards  for  grading  diemistry 
laboratory  reports  for  large  classes  at  the  Air  Force  Academy^    We  realized 
that    by  using  an  interactive  language  siich  as  APL,  and  by  placing  com- 

s 

puter  terminals  in  the  laboratgry,  we  could  not  only  ease  the  grading 
^burden  for  our  teadiing  assistants  and  collect  data  for  class  analysis, 
but,  more  iir5>ortantly,  could  also  permit  the  student  to  distinguish  between 
experimental  eiror  and  calculation  error  in  data  treatment  while  his  apparatus 
was  still  available  for  repeats.  •  , 

As  soon  as  the  student  has  finished  collecting  data  for  his  first  run, 
he  goes  to  the  eoitputer  terminal  in  the  lab  and  types  CHEMLABl,  which  starts 
function  execution.    (Just  prior  to  the  laboratory  period  the  lab  instructor 
has  signed  onto  the  computer  and  loaded  a  copy  of  the  oH^AB  workspace  from 
the  publi):  library  into  his  active  workspace  by  a  sinple  system  command. 
Three  additional  system  commands  initialize  the  program  so  it  is  re^dy  for 
use  ty  students  in  liis  lab  section.)    This  fun.ction  has  three  main  parts. 
Access  to  each  part  is  detemti^ned,  by  a  sequencing  code  in  the  program.  The 
first  part  provides  for  entry  of  primary  experimental  data.    The  student, 
is  instructed"^ to  enter  his  name,  sample  designation,  weights,  temperature, 
barometer  reading,  and  collected  volume  of  oxygen.    The  program  then  cal- 
culates the ^theoretical  volume  of  oxygen  (using  student's  conditions  of 
ten5)erature  and  pressure,  and  using  %  composition  information  stored  in  pro- 
•gran  for  the  designated  sample)  and  compares  it  with  the  student's  experimental' 
value.*  Ttie  con^uter  response  is  determined  by  the  agreement  between  these  ' 

*    (See  appendix  for  sample  execution* of  CHEMLAB,) 


values;    if  the  student's  data  3,s  good/>;^  is"  told  to  proceed  with  his 

calculations.    His  n^ame  and  .data  ^re  stored  in  .the  workspace  and  the' se- 

quencirig  code  is  advanc|d,  so  that 'the  next  time  the.  same  studenst  calls 

*      '•    ^.  •  '     ,  .  » 

for  the  program,  it  ctutomatic^Uy  branches  to  Part  2.'  If  the,  agreement  is 
fairly  good,  the  student  is  allowed*  to^roceed  with  his  calculations,  but  , 
•is  advised  to  repeat  the  experiments    If  the  agreement  is  pooi^,  then  the 
student  is  told  that  his  data  is  unacceptable  and  fhat  he  must  repeat. the.. 
^  expe^riment.    In  the  case  of  fair  Or  poor  results",  leading, questions- are 
asked  by  the  conqputer  to  help  the  student  pinpoi^^t  likely  sources  of  error. 
Once  his  data  is  Accepted  by  the  computer,  the. student  can  then,  from 
this  primary  data,  calculate  the  weight  of  oxygen,  the  partial  pressure  of. 
oxygen,  aijd  the  molar  volume  of  oxygen  at  STP.'  Part  2  of  biEMLABl  requires 
the  student  to  enter  his  calculated  values.    The  program  then  does' the  same 
calculations,  using  the  student's  primary  data  which  is  stored  in  the  DATA 
matrix  and  the  carrection  factors  stored  i^n  the  program.    The  cqng)uter  also 
calculates  the  theoretical  values  for  the    particular  san5)le  which  the 
student  h»as  analyzed.    The  three  columns  of  data  are  then  printed  otit  side 
by  side  for  con^arison.*  This  allows  the  student 'to  check  immediately  a)  th 
validity  of  his  calculations,  and  b)  the  accuracy  of  his  experimental  mea- 
surements.   The  student  includes  this  part  of  the  computer  printout  in  his 
lab  report,  and  is  e;q)ected  to  discuss  the  agreement  or  disagreement  of  the 
three  columns. 

Again,  if  the  student's  data  is  acceptable  he  may  proceed  to  ?art  3, 
From  the  experimental  weight  of  oxygen,  the  known  molecular  weight  of  O2, 
the  total  sample  weight,  and  the  balanced  equation  foV  the  reaction,  he  can 
calculate  the  percent  potassium  chlorate  in  his  sample.    He  then  enters  the 

*    See  appendix.  - 
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result|  of  this  calculation  in  Part  3.    The  program  does  the  same  cal- 
culations  and  prints  out  the  two  columns  of  data,  along  with  the  theore- 
tical %  KCIO3  for  the  student*s  san5)le.    Again,  the  student  can  check 
both  the  validity  of  his  calculations  and  the  accuracy  of  liis  results. 

Once  the  student's  data  is  accepted  and  saved  by  the'  coiiputer,  it 
is  stored  in' the  workspace  and  cannot  be  altered.    The  program  LABINSTRUCT 
permits  the  laboratory  instructor  to  retrieve  all  of  the  names  and  data 
from  his  lab  section  in  a  defined  format  for  coiq>arison,  analysis,  and  . 
-grading. 

This  program  was  initially  written-in  the  summer  of  1970  and  first  ' 
used  by  a  small  class  in  that  summer  session  bf  general  chemistry.  .The 
experience  *with  the  summer  group  indicated  that  the  program  was  useful  and 
led  to  fairly  extensive  modifications  and  expansion  of,  the  progra^.  The 
revised  program  was  used  by  over  300  students  in  the  fall  semester  of  1970. 
Although  a  few  students  regarded  the  use  of  \he  computer  as  an  extra  chore, 
the  general  response  of  the  students  can  b^  described  only  as  enthusiastic 
and  excited.    Students  'suggested  further  modifications  of  the  program  tb 
make  it  moj^  helpful.    This  led  (0  further  revisions  which  were  incorporated 
for  use  in  the  spring  semester.    Students  al'so  suggested  that  similar  pro- 
grams'  for  other  experiments  would  be  very  helpful. 

The  program  also  rece^ived  strong  support  from  the  lab  instructors. 
•Usually,  for  this  experiment,  if  the  final  calculated  results  are  in  error, 
the  instructor  must   refigure      each  calculation  until  thd  error  is  discovered, 
it  is  therefore  quite  a  chore  to  evaiudle  constructively  and  conscientiously 
-  40-50  lab  reports.    The  computer  program  eliminates  all  of  the  "busy,  work'^* 
for  the  , instructor  and  allows  him  to  pinpoint  the  source  of  error  very  readily. 


1    •      •  ^  ■  ■ 

I  .  .      ■    '     ,-  . 

In  many^  cases,  because  the  studgjit  can  also  check  his  own  errors,  he  can 

discuss  his  e^erimental  results  more  i^tel^igently. 

Finally,  ready  access  to  primary  data  for  the  entire  large  class  ^ 

permitted  us  to  examine  closely  the  results  of  the  experiment  and  to  set 

f 

reasonable  limits  of' acceptability. 

The  CHEMLAB*  program  has  been  described  in  some  detail  because  it  is 
our  most  successful  program  to  date.    Vfe  are  encouraged  by  its  receptipn 
and  are  convinced  that  this  is  a  useful  way  to  use  the  computer  in  the 
laboratory.    The  progi>afra.  continues  to  develop  with  eadi  class.  Future 
intended  modifications  iiiclude  tutorial-type  questions  to  guide  the  student 
into  more  thoughtful*  analysis  of  the  experiment. 

Advanced  Courses; 

APL  programs  I^aye  also  been  used  in  some  of  the  advanced  courses  in 
the  chemistry  curriculum.    For  example,  in  the  organic  laboratory  a  pro- 
gram is  used  to  calculate  the  percent  composition  of  liquid  mixtures  from 
vapor  phase  chromatography  data.    In  the  introductory  physical  chemistry 
course  a  program  has  been  written  (by  a  student)  which  calculates  the  eigen- 
values and  wavefunctions  foi^  a  one-sided  potential, well.    By  varying  the 
parameters  (depth  and  width  of  well 'and  mass  of  particle)  the  student  can 
follow  the  effect  of  parameter  variations  on  the  eigenvalues  and  attempt  td" 
discover  validL  relationships. 

The  program  for  determination  of  activation  energy,  mentioned  earfier, 
was  actually  developed  for  u^e  by  juniors  and  seniors  in  a  physical  chemistry 
lab  experiment  on  the  internal  rotation  of  amides,  measured  by  a,  nuclear 
magnetic  resonance  method.    The  data  olitained  are  not  susceptible  to  simple 
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least  squares  treatment,  so  the  students  use  a  computer  program  to  cal- 
cula^  weighting  functions  for  statistical  treatment  of  their  NMR  data  (6). 
/       Finally,  in  the  physical  organic  courses,  students  have  been  required 
to  develop  their  own  line-fitting  program  in  APL  for  treatment  of  kinetics 
and  oth^r  experimental  data. 

Other  Departments ; 

Several  other  departments  at  State  University  of  New  York  at  Binghamton 

have  also  been  experimenting  with  APL  in  their  courses.    In  the  genetics 

and  biostatistics  courses  fairly  simple  programs  have  proven  to  be  both 

fascinating  and  useful  to  the  student  (7).  '  The  program  RANDOMDRIFT  is  an 

example  of  a  conqputer-simulated  experiment  in  genetics  which  could  not  ^ 

possibly  be  done  by  the  students  in  the  laborlatory.    If  a  population  is 

limited  in  size,  or  if  only  a  small  fraction  of  the  population  is  responsibl 

for  the  offspring  6f  the  ne^t  generation,  the  genetic  conqposition  of  the 

population  will  fluctuate  in  a  random  manner.    The  smaller  the  population, 

the  greater  the  random  fluctuation.    This  is  known  as  the  Sew aJIr Wright 

Effect.    Instead  of  merely  bein^  informed  of  this  experimental  observation, 

the  student  is  permitted  to  experience  the  sensation  of  discovery  by  using 

an  APL  function  which  is  programmed  so  that  half  of  the  population  in  each 

generation  is  randomly  selected  to  be  parents  for  the  next  generation.  The 

student  starts  with  four  allelic  gene's,  selects  the  size  of  the  population 

sample  and  continues  the  reproductive  process  until  three  of  the  four  loci 

are  lost  from  the  population  by  random  drift. 

Extended  tutorial  programs  which  simulate  physical  laws  otherwise  dif- 
l  '  1 

ficult  for  the  student  to  investigate  have  beeh  most  successful  in  the 
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physics  department  C8) .    Prior  to  the  simulated  exjJferiment  a  series  of 

.  questions  in  multiple  choice  format  familiarizes  the  student  with  the  prob- 

*  lem  and  provides  a  guide  to  intelligent  selection  of  parameters •  Wrong 

r 

choices  lead  to  appropriate  branches  in  the  program  which  contain  addi- 
tional drill  and  guidance.    The  student  specifies  all  important  para- 
meters for  the  experiment,  and  observes  the  results,  usually  in  graphical 
form.    Additional  tutorial  questions  guide  the  student's  analysis  of  the 
results  and  help  him  discpver  the  parameter  relationships.    For  example, 
the  program  ORBI'T plots  the^. orbit  of  a  satellite  launched  tangentially 
from  a  central  body.    The  student  specifies  the  central  body  and  its  mass, 
the  satellite  mass,  and  initial  distance  and  itelocity.    He  then  observes 
the  effects,  of  va^injg  these  parameters  on  the  orbit  of  the  satellite.  Other 
tutorials  in  physics  include  an  investigation  of  wave  behavior  and  inter- 
ference patterns,  and  analysis  of  Michelson-Morley  type  experiments  to 
arrive  at  postulates  of  relativity.    Indeed,  these  programs  have  been  so 
successful  that  the  physics  department  this  year  has  rejflaced  the  intro- 
ductory physics  laboratory  with  a  "structured  combination  of  computer  lab- 
oratory and  hands-on  demonstrations".    The  traditional  introductory  lab  has 
been  upgraded  and  ihoved  to  the  sophomore  year.  * 

Conclusion: 

.   Although  our  experience  so  far  is  insufficient  to  make  extensive 
judgements  about  the  educational  yalue  of  our  efforts  in  computer- ass is ted 
instruction,  there  are  a  nunber  of  very  encouraging  aspects  of  our  e^loration 
to  date: 

1)    We  have  developed  some  programs  which  appear  to^be  useful  and 
unique  as  instructional  aide  ajid  whi<iiK,  can  be  successfully  inte- 
grated into  our  cdurse  programs.  ,The  more  workable  model  programs 
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\  *     we  have  on  hand,  the  easier  it  is  to  formulate  and  develop 
additional  programs • 

2)  We  are  succeeding  in  introducing  a  significant  number  of  students 
to  the  coii5)uter  and  Its  potential  uses,  and  they  are  responding 
enthusi  as  ti  cally . 

3)  A  significant  nulnber  of  students,  on  their  own  initiative,  are 
writing  their  own  programs  in  APL  and  continuing  to  use  the  com- 
'puter  for  course  work,  lab  .data  treatment,  and  research  data 
toalysis* 

These  are  the  encouraging  factors*    The  discouraging^  ones  are  the  cost 
in  b^h  time  and  money  to  support  and  develop  such  a  program,  .These  cost 
factors  are  difficult  to  determine,  since  tiiese  programs  ui\dergo  almost 
continual  evolution  eVen  after  considerable  use.    Recent  conservative  ds- 
timates  at  our  institution  C8)  indicate  that  the  development  of  a  long 
tutorial  like  ORBIT,  or  an  involved  program  like  CHEMLAB,  requires  approxi- 
mately 100-120  hours  of  full  time  effort,  .Thus  the  cost  for  author's  time 
would  be  in  the  range  of  $1000  for  such  a  program,  .The  only  other  cost  to 
the  department  in  the  development  of  the  program  is  CPU  time,  at  $100/hpur. 
In  most  cases  this  is  small  conpared  to.  the  cost  of  the  author's  tinje. 

The  major  costs  in  a  CAI  program  are  tei^ninal  rental,  ^t  roughly  $1500 
per  year  per  terminal,  and  student-generated  CPU  time,    A  typical  calcu- 
lation (assuming  five  terminals,  an  average  student  connect  time  of  an  • 
]^our  a  week,  and  an  average  20:1  ratio  of  connect  time  to  CPU  time)  fields 
an  estimated  cost  of  $20,000  per  year  for  a  class  of  100  students,  or 
roughly  $200  per  studfent,    pbviously,  if  the  cost  remains  at  this  level, 
one  must  decide  whether  it  is  more  useful  to  have  the  computer  available,' 
or  to  hire  another  faculty  member  (or  two)  I  '  * 
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At  present  however,  given  the  necessary  funds,  a  willing  and  able 
computer  center  staff  and  the  continued  interest  of  both  faculty  and 
students,  we  plan  tJ&  continue  to  worH  on  the  development  of  programs  for 
computer-assisted  instruction.  '  ^  ^ 
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^  Appendix 

\ 

Title 

Nuiriber   

1  Beginning  portion  of  APL- program,  CHEMLABl, 

2  Portion  of  APL  program,  CHEMLABl, 

3"  Sample  execution  of  CHEMLABl,  part  1. 

4  Sample  execution  of  CHEMLABl,  part  2. 

5  Sdjotple  execution  of  CHEMLABl,  part  3. 

6  Sample  printout  of  stored  ^data  from  CHEMLABl, 
by  program  INSTRUCTOR, 
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Computer-Generated  ^peatable  Tests 
in  Chemistry 

by  Franklin  Prosser  and  John  V.  Moore 
0  Indiana  University,  ^loomington 

The  specification  to  students  of  performance  (behavioral), 
objectives,  as  has  teen  pointed  out  by  Jay  Young  ,  is  more  cownonly 
(and- emphatically)  done  by  means  of  examinations,  .Examinations 
should  involve  other  aspects  (self -evaluation,  requirement  of 
'additional  study,  opport.urlty  for  original  thought  in  applying 
principles  to  real  situations)  Iwhich  are  important  in  inducing  / 
students  to  learn,    In  the  laxge-scale  classes  common  in  many  I 
universities  (and  to  a  majority  of  students  in  general  chemistry) 
such  aspects  of  testing  have  been  subordinated  to  the  necessity  of 
evaluating  a  student's  progress  and  comparing  him  with  others. 
This  has  come  about  as  a  result  of  lack  of  time  and- energy  on  the 
part  of  instructors  in  large  courses,  and  because  of  a  failure  to 
rec'ognizrf  that  such  courses  differ  qualitatively^fis  well  as  quan- 
titative^ from  the  clacsses  of  thirty  to  fifty  taught  in  the  "good  ^ 
old  days,^    The  problem  is,  of  course,  readily,  apparent  to  studehts, 
who  often  fail  to  find  anything  worthwhile  in  our  systems  for  testr 
ing,  and  therefore  call' for  the  abolition  of  all  grades.    If  the 
only  function  of  our  examinations  ife  to  order  students  (to  a 
precision  of  four  significant  figures)  from  best  to  worst,  such 
calls  probably  ought  to  be  heeded.  ^ 

In  this  paper  we  present  a  system  of  testing  which  allows  the 
instructor  to  overcome  many  of  the  drawbacks  of  conventional 
testing  without  signiflciant  increases  in  time,  energy  or  money  being 
spent,  provided  the  system  is  used  for  a  four  year  period"]    (as  with 
any  innovation,  a  certain  activation  energy  must  be  overcome  ini- 
tially, but  once  begun,  the  system  appears  to  have  a  negative  free 
energy  change,)    Although" not  a  form  of  Computer-Assisted 
Instruction  (CAl)  in  the  defined  sense  of  the  term  ,  this  system 
is  a  form  of  computer-managed  (or  computer  facilitated)  instruc- 
tion-^.    For  large-class  instruction  a  computer  is  required  to 
handle  clerical  tasks,  but  there  is  no  on-line  interaction 
between  student  and  computer.    This  means,  of  course,  thai  the 
technological  and  economic  disadvantages  of  CAI  are  elirtinated. 
It  also  means  that  the  option  of  adopting  the  system  lies  with  the 
individual  instructor,  rather  than  the  university  or  a  federal 
agency.    The  necessary  computer  hardware  is  available  on  most 
campuses  sind  ^  can  provide  appropriate  software  to  those  who  need 
it.    As  will  be  demonstrated  later  in  this  paper,  many  of  the" 
advantages  of  CAT  are  inherent  in  our  system  as  well, 

V 

The  system  to  be  described  consists  of  computer-generated 
repen.tab]e  tests  (CGRT),    Large  numbers  of  unique  but  equivalent 
tests  are  g'6nerated  by  a  computer  program  which  takes  stratified 
random  samples  from  an  item  pool  and  prints  out  .questions  and 
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answers  in  a  format  appropriate  to  test  taking  and  machine  or 
hand  grading.    Using  this  system  students  may  be  examined  more 
frequently,  encoura^ng  them  tn  k^^ep  current  and  allowing  a 
better  opportunity  for  self-pvaluation.    Students  get  immediate 
(within  forty  five  minutes)  feedback  since  answer  keys  (jpcluriing 
suggestions  for  remedial  study)  are  provided  as  soon  as  the  test 
is  over,    Most  importantly,  the  exams,  because  they  are  unique, 
may  be  given  repeatably,  '  This  allows  the  student  to  take\^  test,- 
discover  some  of  the  performance  objectives  he  had  not  tttohght  of 
before, study  or  review  to  increase  his  mastery,  and  retake  the 
test,    These  exams  have  real  pedeigogical  value,  and  much  of  the 
trauma  associated  with  testing  large  classes  has  been  eliminated,  ^ 

The.Co?iputer  Generated  Repeatable  Testing  process  typically 
consists  of  four^stepsi    (l)' developing  pools  of  test  items,  (z) 
producing. tests,  (3)  administering  the  tests,  and  (4)  scoring  the  ^ 
tests,    The  second  and  fourth  steps  are  managed  by  computer,  while 
the  execution  of  the  first  and  third  steps  is  strongly  influenced 
by  the  computerized  nature  of  the  process. 

For  each  exam,  the  course  instructor  develops  a  poo]  of  "^tems 
(test  questions)  which  forms  the  data  base  from  which, tests  are 
•nrepared.    This  is  ?  rather  formidable  step.    Our  experience 
indicates  that  one  should  have  at  least  six  to  ten  items  in  the 
pool  for  each  question  on  an  exam  to  assure  adequate  variation  on 
the  individual  tests.    An  instructor  planning  to  give  eight  exam? 
of  twent^'  questions  each  should  construct*  about  "'f'ifteen  hundred 
individual  items  for  his  cr^urse.    This  work,  although  it  has  the 
advantage  of  being  jaore  familiar  to  most  instructors  than  progt*am- 
ming  CAI,  Is  every  bit  as  tedious  and  tim**  consuming  as  it  sounds. 
It  should  be  done  prior  to  the  first  semester  in  which  repeatable 
testing  is  to  be  used  in  the  course,    Fortunately,  the  itemi^oolf?, 
once  developed,  are  rather  permanent,  especially  for  the  basic 
coll%e  under^J^raduate  courses  that  are  the  most  ]ikely  candidates 
for  this  computerized  testing  scheme.    Only  relatively  minor  alter- 
ations to  the^  item  pools  are  needed  to  accommodate  other  instructors, 
chamges  of  texts,  etc,,  that  may  ocnur  in  subsequent  semesters. 
Further,  textbook  publishers  often  have  compendiums  of  test  ques- 
tions for  their  popular  texts,    Reusing  test  questions  semest'^r 
after  semester,  or  even  making  the  entire  item  pool  available  to 
students,  is  not  a  disadvantage  under  our  procedure;  and  i'^  fa^+ 
is  "likely  to       dl*^tinctly  advantnge^us ,  sinoe  it  is  merely  a 
specification  of  behavioral  r^bj^ctives. 

Since  items  will  ultimately  appear  on  comput«»r  /oj-ener^^ted 
tests,  the  form  of  the  Hems  must  conform  to  the  requi7^ment<?  of 
present  computer  printing  techno! o/^,    Normally,  items  may  con- 
sist of  upper-case  letters,  numb*^^,  and  the  usual  smcia^ 
phpT^'^+orf?  ^»vaMable  on  modern  hlf^h-speed  line  printers  DiarrpTn^, 
pictures  and  other  graphic  aids  cannot  usually  t>e  printed  directly, 
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although  the  instructor  may  easily  include  these  by  providing  the 
student  with  k  supplementary  sheet  of  diagrams  to  accompaJiy  the 
tests.    Our  experience  has  been  that  such  "handouts"  are  quite 
adequate  for  presentation  of  structural  formulas  and  other  chemical 
information  not  readily  printed, 

If  the  test's  axe  to  be  graded  manually,  technology  imposes  no 
limitation  on  the  structure  of  the  ajiswer  to  aji  item.    The  test 
questions  may  elicit  objective  or  subjective  relsponses  from  the 
student.    On  the  other  hand,  if  the  instructor  wishes  to  use 
V mechanical  grading  techniques,  he  must  provide  for  a  single** 
character  response  for  pach  ■'tern,  becaur.e  of  restrictions  imposed 
by  the  optical  mark  sense  form  readers  usually  available  in 
universities.    While  this  requirement  may  appear  to  be  a  severe 
limitation,  it  in  fact  allows  considerable  freedom  in  the  form  of 
objective  test  items.    True-false  and  multiple  chojoe  items  call 
for  single  character  responses.    Key-word,  fill-in,  and  other 
forms  resulting  In  a  definite  numeric  or  symbolic  answer  may 
easily  be  rejiuced  to  a  single  character  response  using  the 
following  convention  I    In  such  a  question  the  form  of  the  answer 
is  indicated  by  a  series  of  dots  which  includes  one  asterisk.  The 
s-tudent  will  construct  his  symbolic  or  numeric  answer  to  the 
question,  and  will  record  as  his  response  on  his  mark  sense  form 
the  single  character  selected  by  the  position  of  the  asterisk  in 
the  string  of  dots.    For  example,  means  cQde  the  third 

letter  or  digit  of  the  answer,  means  the  first  letter  or 

digit,  and  so  forth,    Studen^ts  describe  such  alphabetically  or 
numerically  coded  items  as  being  hard  but  fair.    The  student 
cannot  answer  snch  an  item  unless, he  has  mastered  the  basic  con- 
cepts and  vocabulary.    Recall  is  emphasized;  simple  recognition  Is 
subordinated. 

In  addition  to  the  question  part  of  an  item,  which  the 
student  sees  when  he  takes  a  test,  each  item  also  has  an  answer 
part,  to  ^llow  machine  grading  and  to  provide  information  to  the 
student  after  testing.    The  answer  part  of  an  item  may  contain,  in 
addition  to  an  answer  character,  any  relevant  information,  such  as 
the  full  symbolic  or  numeric  answer,  textbook  page  references,  and 
other  diagnostic  aids  for  the  student. 

After  the  instructor  has  developed  a  section  of  his  test  item 
pool,  he  will  have  it  p\uiched  onto  punch  cards  or  entered  into  an 
appropriate  editable  data  file  system.    To  facilitate  the  selecion 
of  items  fpr  an  individu;^]  test  and  to  maintain  order  among  the 
large  item  pools  the  instructo^^  classifies  his  items  into  sets, 
the  items  within  a  set  are  given  distinct  un"^t  numbers,  and  cards 
or  lines  for  the  question  part  and  the  answer  part  of  each  item 
are  numbered  serially.    Usually  a  set  will  consist  of  those  items 
that  te^t  similar  materia^. 
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The  individualized  tests  are  generated  on  a  digital  com|)ut'er 
using  a  computer  program  GENERATpR,    T^s  program  reads  the  Item^ 
pool  for  a  particular  exam,  checks  the  input  data  for  proper 
sequencing  atnd  correct  format,  reads  information  describing  the 
tests  to  be  generated  (number  of  tests,  number  of  questions  per 
test,  et6j,  generates  and  prints  the  individual  test,  and  punches 
a  small  ajiswer  summary  deck  for  use  in  mechanized  grading.  Each 
t^st  is  individually  numbered  aind  has  questions  on  the  left  part 

-  of  the  line  printer  page  and  ainswers  on  the  right.    The  item 
identification  numbers  for  each  question  appear  in  the  answer  part 

-  for  reference.    The  instructor  will  of  course  separate  the  answer 
part  from  the  question  pairt  prior  to  giving  a  test  to  the  student. 

The  computer  program  selects  items  for^a  test  by  randqmly 
choosing  an • item  from  each  set.    The  order  of  choosing  sets  is 
also  randomized.    No  item  is  used  ^ore  than  once  per  test.  ^  The 
digital  computer  is  vital  to  test  productions,  since  the  random 
item  selection,  formatting,  and  printing  of  large  numbers  of 
individualized  tests  is  beyond  the  capacity  of  nonautomated  oper- 
ations.   The  instructor  may  alsd  assign  weights  (point  values)  to 
sets  of  items,  thus  allowing  hit  to  emphasize  particular  topics^ 
or  award  points  abased  on  the  -difficulty  of  items.  ! 

The  computer  time  required  to  generate  the  tests  is  very 
small?  the  time  required  to  print  tests  is,  however,  substantial. 
Typical  times  on  the  Indiana  University  CDC  36OO  computer  system 
are  about  four  minutes  of  computer  time  (of  which  about  20  seconds 
are  for  item  selection)  to  generate  1,000  three-page  tests,  and 
about  three  hours  of  printer  time  to  print  them.    As  we  show  later, 
the  total  cost  per  test  is  about  5  cents. 

P 

The  instructor  decides  for  himself  how  emd  whe^i  to  test  his 
students.    He  may  give  tests  in  class  or  at  other  scheduled  times; 
or,  more  flexibly,  he  may  allow  his  stiidents  to  choose  their  own 
times  for  testing.    A  combination  of  in-class  testing  preceded  or 
followed  by  opportunities  for  student- scheduled  testing  appears 
to  be  useful.    Such  options  depend  on  the  instructor's  preference 
and  the  availability  of  testing  rtfom  space  and  personnel, 

A  student  taking  a  test  usually  obtains  an  individualized 
test  (with  answer  part  removed)  and  a  mark  sense  form  and  specia] 
pencil.    He  takes  a  seat  in  the  testing  area  and  immediately  enters 
on  his  mark  sense  from  his  student  identification  number  (social 
security  number  or  other  agreed-upon  identifier),  the  exam  number, 
and  his  individual  test  number.    The  student  then  mrks  his  an3wer'^ 
on  his  test,  and  for  each  question  enters  the  appropriate  single- 
character  response  on  his  mark  sense  form>^  After  completing  a 
test  the  student  exchanges  his  mark  s^ense  form  for  the  answer 

*  *  '*  i 
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part  of  his  Individual  test.    The  mark  sense  form  is  kept  \>j  the 
proctor  for  later  grading.    The  student,  having  the  correct  answers 
in  hand,  caji  inuiediately  determine  his  errors,  and  is  stimulated 
to  improve  his  knowledge  of  weak  areas. 

The  instructor  or  his  assistant  will,  whenever  convenient^ 
have  the  information  on  the  mark  sense  forms  transformed  to  pimch 
'cards  on  an  optical  marie  sense  form  reader.    This  step  is  required 
to  obtain  a  form  of  input  acceptable  to  the  typical  academic  compute- 
ing  facilitityi  one  can  bjrpass  this  step  if  optical  mark  sense  fbrm 
reading  equipment  is  attached  directly  to  his  institution's  computing 
equljment. 

Scoring  of  the  student  responses  for  an  exam  is  done  by 
computer  using  program  GRADER,    Input  to  this  program  is  the 
answer  summary  deck  punched  by  program  GENERATOR  when  the  tests 
were  prepared,  and  the  student  response  cards  derived  from  the  mark 
sense  forms.    Output  of  this  program  is  a  roster  of  student  ID's 
and  test  scores  and  a  pimch  card  deck  of  the  high  scoire  for  each 
student  for  this  exaun. 

As  a  followup  of  test  scoring,  we  are  developing  an  item 
analysis  procedure  for  GCRT,    Since  the  item  pools  tend  to  be 
reused  many  tines,  such  an  item  analysis  will  aid  in  the  detection 
of  defective  test  items  and  will  assist  the  instructor  in  polishing 
his  item  pool. 

The  test  producing  program  GENERATOR  and  the  scoring  program 
GRADER  are  written  in  Fortran,    Virtually  all  acedemic  computing 
centers  have  well-maintained  Foirtran  compilers  that  produce  a 
fairly  good  quality  object  code.    We  have  several  versions  of  the 
CGRT  programs!    well-Kiocumented  ANSI  Fortran  versions  designed  to 
run  on  all  commonly-available  computers,  and  specialized  versions 
of  GENERATOR  for  the  CDC  36OO  and  for  the  CDC  66OO,    The  special- 
ized versions  utilize  CDC  extensions  of  ANSI  Fortran  to  decrease 
the  execution  time  dramatically  by  by-passing  the  repetitive 
processing  of  format  statements  during  test  printing.  Since 
GENERATOR  is  completely  output-bound,  we  anticipate  that  many 
potential  users  of  the  ANSI  Fortrain  version  would  wish  to  discuss 
modification  of  the  program  with  their  systems  people  to  take 
advantage  of  local  extensions  to  Fortran  output  facilities. 

At  first  glance  a  procedure  that  uses  a  computer  for  test 
preparation  and  for  printing  of  individualized  tests  appears 
economically  unsound.    This  is  very  definitely  not  the  case.  In 
Tables  I  jand  II,  which  are  cost  analyses  for  the  preparation, 
printing,  and  scoring  of  1000  three-page  test?,  we  have  attempted 
to  itemize  expenses  in  a  similar  manner  for  both  CGRT  and  the 
conventional  method.    Therefore,  the  cost  of  a  computer  line 
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printer  and  associate^  equipment  has  been  treated  as  a  sepsurate 
entry,  rather  than  included  in  general  computer  charges.    We  haVe  . 
assumed  that  such  expenses  as  the  initial  keypunching  of  item  pools 
are  distributed . over  four  semesters. 

The  analyses  show  that  both  CGRT  and  conventionally  prepared  ^  ^ 
tests  cost  about  5  cents  per >test  (exclusive  of  the  instructor's 
time  required  to  generate  questions).    While  the  estimate  for  con- 
ventional ex^s  is  fairly  accurate,  changing  some  of  the  assump- 
tions in  the  CGRT  analysA<s^y  alter  the  esti^iate  by  perhaps  up  to 
two  cents  per  test.    A^o,  under  repea table  ^testing,  students  tend 
to  take  more  than  one  repeatable  test  over  each  examination  unit. 
In  any  event,  the  cost  of  repeatable  tests  is  in  the  same  range  as 
conventional  tests.    More  importa^it,  the  expenses  of  the  CGRT  process 
are  a  very  minor  item  in  the  cost  of  educating  the  student, 
amounting  to  $.50  to  $1.50  per  student  per  course.    This  is  inex- 
pensive education! 
/ 

The  results  of  an  attitudinal  survey  of  about  four  hundred 
students  in  chemistry  CIO5  at^ Indiana  University  during  the  fall 
semester,  1970,  are  reported  as  percentages  in  Table  III.  The 
survey  was  administered  shortly  after  the  Christmas  break,  but 
prior  to  the  last  repeatablevexaft  and  the  non-repea table  final 
exam.  ^ 

Some  s^pecifics  as  to  the  operation  of  CGRT  in  chemistry  IO5 
are  in  order  before  the  survey  results  are  discussed.    Seven  exams 
were  given  during  the  fifteen  week  semester.    In  each  case  exams 
were  made  available  at  the  beginning  of  a  two-week  period.  Stu- 
dents were  allowed  up  to  three  "takes"  of  each  exam,  with  only 
the  highest  score  recdWed  for  the  final  grade.    On' the  deadline 
for  taking  each  exam  the  usual  ^op^atable  test  forms  were  adminis- 
tered in  lecture  class.    A  few  students  took  make-up  exams  after 
this  date,  but  this  was  not  common.    Computer  grading"^was 
necessary  because  of  the  large  number  of  exams  being  given,  so 
answers  were  recorded  as  single  characters  on  mark-sense  forms. 
In  cases  where  a  question  was  ambiguous  or  incorrect  answers  were 
suspected,  students  jised  a  "complaint  sheet"  to  request  additional 
credit.    Structural^ormulas  and  other  like  information  were  , 
presented  by  means  of  "hajidouts"  which  were  coded  to  correspond 
with  certain  sets  of  questions. 

With  regard  to  item  number  one  pn  the  survey,  the  pressure 
involved  in  large-scale  testing  is  evident  to  einyone  who  has 
walked  into  a  room  with  400  students* waiting  to  take  a  test.  The 
tension  is  so  high  that  it  often  given  seasoned  teachers  some 
qualms.    Its  effect  on  students  is,  of. course,  much  greater. 
Moreover,  most  of  the  students  in  such  large-seal^' classes  will 
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not  often  be  faced  with  making  decisions  under  similar  condi- 
tions of  stress,  except  while  taking  college  exams ^    We  feel 
that  removal  of  a  portion  of  this  tension  is  beneficial  in  that 
it  improves  students*  attitudes  as  well  as  providing  a  more 
realistic  evaluation  of  their  performance  and  ability, 

More  than  three  quarters  of  the  students  polled  thought  that 
repeatable  exams  helped  them  to  learn  more  chemistry.    Only  six 
per  cent  felt  that  the  exam  system  was  of  doubtful  or  no  pedagogical 
value.    This  is  somewhat  less  than  the  number  who  bad  similar  feelings 
,  about  the  textbook.    While  student  opinions  cannot  be  substituted  for 
experimental  data  on  the  amount  of  actual  learning  taking  place 
(which  we  are  in  the  process  of  obtaining)  it  seems  quite  clear^ 
that  most  students  are  very  happy  with  the  CGRT  system.    This'  is 
also  apparent  in  th^  third  statement  whet^  nearly  eighty  per  cent 
of  the  class  agreed  that  the  system  offered  an  opportunity  to 
demonstrate  their  "real  understanding"of  the  course  material. 

The  fourth  survey  statement  was  included  because  students  who 
were  the  initial  guinea  pigs  for  the  CGRT  system  during  the  1970 
summer  session  complained  that  they  had  too  much  incentive  to   ^      .  " 
study  and  might  therefore  neglect  their  other  courses  in  favor 
of  chemistry  during  a  normal  semester.    It  is  encouraging  to  see 
that  during  the  fall  semester  most  students  did  not  feel  that 
unreasonable  amounts  of  work  were  required  by  CGRT.    Indeed,  1^ 
may  be  possible  to  include  more  material  in-  a  course  without 
student  complaints  when  CGRT  are  used.    That  is,  more  efficient 
learning 'is  taking  place. 

The  next  two  statements  (5  and  6)  were  included  because 
cheating  has  always  been  a  problem  in  large  scale  testing 
'  situations.    While  the  Incentive  to  cheat  js  less  with  OGRT,  it 
sti^l  T^malns.    The  forms  wh^cH  cheating  can  take,  however,  are 
differ«n+  Vc^ause  of  thp  individitr'^1-^y  pf  the  test  fo^s.    Tn  ou>" 
-vp-^-innrp  prop^^r  administration  of  the  tests  car  eliminate  all  ^ 
f'^rms  of  cheating  (or  at  l<?»f?t  all  that  havp  come  to  mind  been 
trl^^d  so  fa**")  except  blatant  polluslon  between  two  stude^tr  1n 
the- exam  room.    This  latter  can  be  detected  easily  by  alert  ^ 
proctors, 

Statemen-^?  7  and  8  in  T^^^le  TIT  indicate  that  most  stndentn 
find  the  repeatable  test  forms  useful  as  study  guides  (or  beha.vtoral 
objec+Wes).    They  also  have  the  valuable  experience  of  consulting 
with  th^ir  peers  when  +hey  have  a  problem.    They  "are  much  more 
likf^ly  to  study  together  because  the  CGRT  system  pits  everyone 
against  the  compute^*,  rather  than  every  jnan  for  himself  against 
everyone  else.    Once  a  student  has  achieved  an  "A"  he  has  an 
"A"  even  if  it  turns  out  to  be  the  class  average.    The  ominous 
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."curve"  is  absent  because  peTrformance  levels  are  set  by  the  exam 
questions,    If, these  levels  are  met  the. student  gets  credit 
independent  of  whether  others  meet  the  same  levels  or  not. 

Statement  9  and  10  reflect  two  other  aspects  of  the  GGRT 
system,  "'In  a  number  of  cases  information  was  tested  which  had 
not  been  presented  in  lecture,  discussion,  laboratory  or  the  text- 
book,   That  is,  a  few  (relatively  minor)  portions  of  the  course  '  . 
were  taught  by  means  of  rppeatable  tests.    There  are  many  aspects 
of  chemistry  which  lend  themselves  to  this  approach.    For  example, 
simple  stoichiometry  problems/  equation  balancing,  chemical  sym- 
bols, valence,  and  some  aspects  of  descriptive  chemistry  require 
considerable  (very  unpopular)  rote  learning,    One  often  washes  to 
bypass  such  topics  in  lectures,  and  our  experience  indicates  that 
this  can  be  clone  using  CGRT,    Students  faced  with  such  problems 
on  ^xams  simply  find  someone  who  can  explain  to  them  how  a  certain 
type  of  problem  should  be  approached,    Having  learned  this,  they 
solve  it  correctly  on  the  second'  or  third  try. 

Statement  10  indicates  that  jfith  careful  framing  of  questions 
a  test  restricted  to  single-character  responses  need  not  be  picayune 
in  its  approach.    Of  course,  as  with  any  type  of  behavioral  ohnective, 
there  is  often  a  tendency  to  emphasize  details  because  they  are 
more  readily  specified.    Nevertheless"  only  about  one-thirl  of  the 
GI05  students  felt  that  the  major  emphasis  lWts  on  details  as  opposed 
to'  general  principles.    It  must  be  reiterated,'  however,  that  ""the 
CGRT  system  is  only  as  good  as  the  questions  that  are  written  fo-^* 
the  item  pool.    It  does  have  the  cidvantaige  that  poor  questions  cart 
be  weeded  out,  but  a  large  number  of  good,  imaginatively  written 
questions  must  be  available.    These  questions  specify,  to  a  much 
frreater  degree  than  in  the  case  of  ordinary  tests,  what  students  will  * 
study  and  leam.  '  ^  *  '  ' 

The  last  two  statpments  in  the  attitudinal  suryey ^simple 
reemphas*^ze  the  suocpss  of  the  entire  course.    Although  other 
factor?  +han^  the  exam  sy?=?tpm  obviously  ^.ffected  these  student 
attitudes,  the  CGRT  svstem  was  responsible  for  much  of  this 
success,  ^ 

Computer  Generated  Repeatable  Testing  work<^.  ^  It  has  be^^n 
used  in  numerous  courses  for  nearly  three  years  at  Indiana  Uni-  ^ 
versity,  and  It       ^Is^  in  use  at  the  University  of  Nebraska, 
Illl»^ols  Institute  of  '^echnolofy,  Indiana  .Mniverslty-Purdue 
University  at  Ind^ananoils,  and  other  places.    The  m^tTTod-has 
been  enthusiastically  used  by  inst7*uctoT>s  6f  undergraduate  courses 
in  such  varied  disciplines  as  psychology,  chemistry,  computer 
science,  economics,  English,  speech  therapy,  home  economics, 
^ccoonting,  and  education,' 


In  general,  studeni^s  have  been  highly  satiafied  irith  the  \ 
repeatable  testing  method*    Their  aood  la  one  of  aleigeieiia 
rather  than  anxiety,    They,  are  relaxed  during  exaaina^ora,  ard 
their  morale  Is  good.    The  'undergraduate  counse'lllng^  units  of 
Indiana  Unlver8).ty  have  received  nuaeroxis  stxident  o^aaents  favor* 
able  to  CGRT. 

An  unexpected  result  'In  sbne  of  the  CGRT  courses  has  b^en  the 
students*  excellent  performance 'on^ technical  material  not  dlscxxssed 
In  class,    Repeatable  examinations  appear  :to  provide  a  stimulus 
and  a  Way  to  master  material  typically  neglected  by  students  In 
conventional  courses »    Although  we  have  only  a  little  data  taken 
under  properly  control^dld  conditions,  Indications  from  several 
common  achievement  tests  given  at  Indiana  University  are  that 
overall  student  achievement  in  repeatai^tly  tested  sections  ^s  higher 
than  in  conventionally  tested  sections  of  the  same  course.     All  . 
available  evidence  suggests  that,  a  system  of^  frequent  apd  repeat- 
able  examinations  provide^  an  excellent  atmosphere  for  scholarly 
activities  of  beginning  students.  ^ 

We  hope  that  aai^  reade2;s  will  wish  to  try.  (XJRT  or  suggest 
its  usei  to  their  non-computer-oriented  colleagues.    The  computer, 
programs  and  ample  documentation  are  available  from  Ftanklin  ^ 
,Prosser, 
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Table  T.    Cost  Analysis  of  CGRT,^ 


ITEM  COST 


Punched  cards  for  it^m  pools  $  #^0     .  ^ 

(one-time  expense) 

Punch  cards  for  student  responses i  ,  1,00 
1000  cards 

»  I, 
Printer  T)aper:    3000  sheets  8. 30  ^ 

Mark  senp«  forms  i    10^  forms  8.80 

Keypunching  services  fo^  i+em,jv)o''  -    -  -6,50 

punching  (one- tine  expense) 

Computer  charges  for  test  production  1^,70 
and  grading!    a'bout  5  minutes  at        '  '  ^  • 

$200  per  hour 

High  speed  line  printer  and  .controller  6.00  to  $l^.no 

rentj^l  and  maintenance:  .at  $l800 
per  month 


Thtal  expenses  .$^7.70  to  $53.70 


AVmoiJ^  OO^^T  P^^  TfiSTi    ii.R  cents  to  5>  cpnts 


^fp-r  looo  threp-pa^e  tests 

TMIana  Unlv^^sity  CDC  36OO  system 
'CDC  Sl^  printer  system 
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•  f^if  'il. . :  C<3st  /jialysls . of.' CionVentlortaiaLy.  .Pwpwfecl  ..Tests , 


COST- 


ER 


*  \^ipe55j'i  :3000  sXi^is 


^!<irk  8enS>  fonts  i,.  i000\fbrn8 


.fSveo  * 


-■\^Col34i4ttg'a|jd.  atitpi^  p       *  .  IB. 00 

^Coftputer■'6^k^giis  for  gxs^ingt   %L*i0^"t  1.-^  '  *3*30  ^ 

.  minuie^at,$200;per""hbur     ■'.       ^  ... 


AVERAGET.  CQgT  ^PER-.TEST't  '*5.*  1  'cent? 


^for  'lOOO  ihr^e-paga  tests 


e^tJLiuates  Supplied  by  IndiaJia  University 
Chemistry  l)0partnjeht .    .         ,  . 


'Indiahii  University  Cr)C  36OO' system 
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Table  III.    Results. of  Attitudinal  Survey 

1.  To -what  degree  do  you  feel  that  repeatable  exams  remove  the, 
pressure  involved' in  achieving  a  good  grade? 

Very  greatly  57.0056    Somewhat       IjK   Actually  increase 
Considerably  25.506    Not  at  all    3.00^.  pressure  1.50^ 

2.  Do  you  feel  that  a  repeatable  exam,  of  itself,,  has  h^ped  you 
actually  learn  more  material?         ^  ,  - 

Very  .definitely    77.5096    Don't  really     Doubtful  2; 00^ 

Probably  16.0096   know     '^.0096   Definitely  not  2,00^ 

3.  ,The  repeatable  exam  system  gives  me  an  opportunity  to 

demonstrate  my  real  understanding  of  the  course  material. 
A^e  strongly      38/00^     Disagree  moderately     8.75^  ^ 
Agree  moderately   ^1.00^     Disagree  strongly        4,0096  . 

4.  The  repeatable  exam  system  is  just  another  device  to  get  stu- 
dents to  do  unreasonable  amounts  of  woric  for  the  credit  given, 

Agree  strongly  '  1.00^  Disagree  moderately  15,00^ 
Agree  moderately    3.00^      Disagree  strongly  77.00^ 

5.  A  student  should  •'feel  less  compulsion  to /Tjeat  the  system" 
^  when  repeaiable  fexams  are  used.  .        ,  " 

Agree  strongly.'    ^  45. 50^-     Disagree  moderately  7.5Q^ 
Agfee  moderately  '   32.25*      Disagree  strongly  3.00^ 
«      .  •  *  -  '  •  * 

6.  Do  you  think  it  is  possible  to  ."beat  the  system"    when  computpr- 
generated  repeatable  exams  are  em^ployed? 

Very  definitely  -     8.00%   Wo.uldn*t  know     37.00    Very  definite 
s  Probably        ^        ^  31.0\)%   Probably  not       19.00    ly  not  ^.00* 

?,    Do  you  find  that  keeping  the  exams  and  usinp;  the  answer  key 
serve  as  a  good  guide  to  remedifil  study? 

Very  great  help  86.00*  No  help  Only  confuses  me  0* 
Some  help;  i2.00*'    2.0£)*       Wors^  than  nothing;  Q* 

8.    Do  you  find  that" discussing  one  another^ s  exams  increases 

yonr  mastery  as  well  as  bringing  about  a  change  in  your  score? 

Very  much     71.00*    Not  at  all   '       ^.00^  Worse  than  no 

Som'flyhat       ZU.0Of->    Only  confuses  me    1,0C^  discussion  0^ 

0,    The  exams  assume  'too  much  chemical  information  not  pres6n;ted 
in  C105.      '   ,  .   .    '    .  ' 

Agree  st]5:ongly      5.00*     Disagree  moder^utely  :  38.50*- 
Agree  moderately  28.25*     Disagree  stron/=^ly  19^00^ 
No  position  8.50* 
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10.  The  emphasis  in  G105.is  on  details  rather  than,  matters  of- 
general  importance  in  chemistry. 

Agree  strongly      7.003?     Disagree  Moderately  39.00^ 
Agree  moderately  25.50??     Disagree  strongly  19.00^ 
No  position  10.00^ 

11.  Since  taking  0105,  my  interest"  is  chemistry  has i  , 

Increased  greatly  29.00^     Remained  unchbiged     21.50^  * 
Increased  slightly  ^+2.255^     Decreased.  6.25^ 
Terminated    1.0^  • 

12.  The  CI05  course  as  an  intellectual  challenge  isj 

Very  great  •  12.005?'  Average  Below  average  k.OO^ 
Above  average    58. 5(^      2^^.50^      'F^r  below  average  1.00^ 
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The  Effect  of  Computer^  on  the  Teaching  of  Physical  Chemistry 

»  By  Morris  Bader 

Moravian  College 

Physical  Chemistry  is  a  subject  ideally  suited  for  interaction 
with  a  computer.    The  course  itself  is  an  investigation  into  the  nature 
of  things  with  emphasis  on  the  l^ws  of  nature  which  lend  themselves 
to  a  mat'hematical  analysis  of  natural  phenomena.    So,  fp^r  example, 
the  Clausius-Chapeyron  equation  tells  us  how  the  vapor  pressure  of  a 
liquid  varies- with  temperature;  the  Van  Der  Waal's  equation  tells  us 
how  real  gases  behave  under  extremes  of  pressure  and  temperature,  . 
and  even  su.ch  esoteric  equations  as  that  of  Schrodinger  tell  us  how 
electrons  behave  in  atoms  and  molecules.  • 

In  the  past,  we  college  professors  thought  we  were  teaching 
gobd  chemistry  by  having  the  student  plug  numbers  into  equations  ad 
infinitum.    Today,  more  is  needed  and  more  is  now  possible.'   It  still  :-\ 
•  behooves  a  student  to  understand  mathematics  but  we  would  also  li^e  him 
to'  understand  the  underlying  concepts.    Here  is  where  tHe. ^computer 
plays  a'most  impdrtant -role.    The  student,  and  advanced  .student's' 
at  that,  who  may  never  know  how  to  solve  SchrogllLngei^'s  equation. j£o':p*th^*"' 
electron  density  in  a  molecule  might  still,  undejrjst.and  the 'concepts  '^ '.V 
-  ''involved  if  he  could  be  led  to  ask  tho^e  qtrestions  whi^b^; explore  '  . 
areas  of  such  mathematical  complexity  that.  4-^^  tibitfd       sens-eless  to 
•pursue  that  route.    The  computer  program  HMOLC     written  at  Moravian 
College  for  the  IBM  1130  computer  allows^  tpe-'-student  to  ask. the  computer 
to  construct  a  plot  of  the  electron  density  axTQund  2  atpmsT' at  any 
distance  of  separation,  for  any  type  of  el^ctt(?n_Conflgura'tion^  That 
picture  is  worth  a  thousand  lectures.    On  a  course  devoted  to  principles 
one  must  always  be  on  guard  to  bailee  ^the  hfeavy  theoretical  with  the 
manifestly  practj(|Cal.'  ^  ^ 

The  laboratory  is  one  place  where  p'edagogic^l  breakthroughs  can 
be  made  With  a  computer.    First,  routine  calculations  can  be  tatally^ 
eliminated.    The  impact  at  this  .on  th"e  student^is  amazing*    It  encourages 
the  student  to  become  more  pi{ecise  *ih  his  collection  of  experimental'  . 
data,  because  it  becomes  ratVier  obvibus.when  final  results  ate  computed 
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where  he  has  been  lackadaisical.    The  student  is  also  encouraged  to 
take  more  data  since  he  knows  that  the  tedium  of  calculations  is 
no  longer  liecessary.    For  example,  the  program  DERIV  will  accept 
any  typ^  of  analytical  titration  data  such  as  pH  vs  mis,  and  automatically 
plots  the  data,  and  then  plots  both  the  first  and  second  derivative. 
All  instructors  know  it  takes  a  student  literally  hours  to  plot  one 
roxi  while  the  computer  does  this  in  seconds.    Before  the  computer, 
the  tedium,  of  the  plots  completely  counteracted  the  intejit  of  the 
learning  situation,  that  is,  a -student  learned  nevfer  to  do  a  pH 
titration  ever    again.    It  ±s  not  necessary  to  mention  that  DERIV  can 
expand  the  region  of  the  equivalence  point  to  offer  mathematical 
accuracy  more  than  ten  times  t^e  experimental  limitation.    So,  not  % 
only  is  tedium  removed,  but  all  necessary  scientific  ingredients  al^o 
improve.    Occasionally,  workup  of  lab  data  is  so  rugged  an  instructor 
recoils  from  attempting  such  an  experiment.    An  example  of  this  would 
be  the  experiment  in  partial  molal  volumesy  of  1:^^^^^  mixtures.  Weight^' 
mus.t  be  corrected  for  air  buoyancy,  a  pyknometer  (liquid  density 
device)  must  be  calibrated,  der\3ities  and  molecular  weights  must  be 
convetted  to  concentration  units,  and  a  frightening  list  of  Qther    ^     ^  . 
calculations  foUpw.    The  program  MOLRV  does  all  these  calculations 
and  completes  the  job  with  the  final  plot  of  the  molal  volumes.    While  ♦  ^ 
it  is  expected*  that  a  student  understand  thei  calculations  and  even 
^how  samples  in  his  lab  report,  the  computer  emphasizes  the  scientific 
^pects  of -the  laboi^atory.^ 

\  Finally,  we  have  the  area  of  scientific  research.    Can  a 

perW.in  ^  small  college  with  a  small  computer  hope  t5  cpmpete  with 
the  giant  \iniversities  with  t;heir  giant  machines?    The  answer  is  a  definite 
yes!  '  ^  ■  ■ 

^  '  At-  Moravian  College  we  have  been  .able  to  do  fwndam^taL  research 

in  , the  molecular  structure  of  diatdmic  molecules.    The  program  MOSTR  ^ 
accepts  data  from  the  infrared  absorbtion  spectrum*  of  a  diatomic  molecule 
in  the  gas  phase  and  computes  ^a  number  of  molecular  parameters  in  the 
ground,  first,  and  second  excited  states  including  the  anharraonicifty 
constant.    This,  program  h^  beeh  included  in  the  review  article     on  ^ 


"Energy  States  of  Molecules"  as  the  state  of  the  art.    Such  a 
commendation  acts  as  an  enormous  plus  in  an  instructor's  bag  of  tricks. 
A  set  of  4  programs  with  research  capabilities  have  also' been  published 
by  this  author  ^.    These  include  an  NMR  Simulation,  a  Huckel  Molecular 
Orbital  Calculation,  and  Atomic  arjd  Molecular  electron  density  contour 
diagrams,  all  written  for  the  small  computer ♦ 

To  summarize,  physical  chemistry  can  no  lori'ger  be  taught  with 
out  the  computer  as  an  integral  part  of  the  subject.    Hopefully ,; with 
imagination  and  perseverence  other  discipi^pLnes  might  find  themselves 
in  a  similar,  exfcited  state.  ' 


ERIC 


9-43 


References 


1.    "Computer  Programs  dn  Undergraduate  Chemistry",  J,  Chem..  Ed., 
48,  175,  (1971).       '  .  *.    "         /*  .  \  ^ 

2;    'lA  Computerized  Physicai  Chjemistry  Expetiment*' ,  J.  Chera*  ^d.  , 
46,  209,  (1969).      \  .     ,         '  ^  ,        ^  . 

3.  '  Quantum  Chemistry  Program  Exchange,  Program  No.  164,  "MCJSfR  , 

Molecular  Parameters'  of  Di^toWic  riol^cules",  Indiana  UniVfersity, 
Bloomington,  Indiana,  Jan.,  (1971). 

4.  "Energy  State's  of  Molecules",,  J.  Chem.  Ed;,  47,  2,  0970). 


4  or 


ERJC. 


9-U 

A  HYBRID  APPROACH  TO  COMPUTER  ASSISTED  INSTRUCTION 

By 

F,  D.  Tabbutt 
Reed  College 

Portland,  Oregon  97202 

Although  analog  and  digital  approaches  to  assisted  Instruction 
have  complementary  qualities^  thoir  presentations  are  not  always 
complimentary  to  each  other.    The  analog  computer  users  listen  to 
the  new  and  old  methods  of  calculating  and  displaying  digital 
solutions  to  the  damped  oscillator  with  a  certain  superiority  that 
stems  from  the  knowledge  that  they  can  do  it  so  much  more  quickly 
and  inexpensively  on  an  analog  computer.    The  digital  computer  user 
has  a  similar  superiority  when  listening  to  the  papers  on  analog 
CAI,  for  he  knows  that  although  t;he  analog  medium  is  highly  intern- 
active,  the  choice  of  problems  is  extremely  limited.    In  fact,  in  . 
this  day  of  sophisticated  electronics,  the  stone  age  image  which 
the  analog  patch  panel  presents  whether  it  be  time-shared  or  not, 
i^  not  likely  to  entice  very. many  new  members  into  the  clyb. 

These  observations  are  not  new.    Many  have  seen  and  probably 
experienced  them.    Most  people  involved  in  CAI  have  recognized  the 
chasn)  and  the  sometimes  wasteful  competition  between  the  two  basic 
types  of  computer  instruction.    Most  have  also  been  aware  of  hybrid 
systems^  wnich  if  properly  designed  can  take  advantage  of  the 
superior  qualities  of  both  systems."  This  paper  is  a  report  on 
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progress  along  the 'next  obvious  stop  — tima  sharing  a  hybrid 
computer  for  instructional  purposes. 

General  Objectives  and  Design  of  the  System 
The  primary  objective  of  this  instructional  system  has  been.^ 
interactive  graphic  flexibility  in  dealing  with  mathematical ^ 
topics.    To  achieve  this  end  the  system  was  to  minimize  the  need  . 
for  language  format  and  programming  on  the  part  of  the  liser,  and 
to  maximize  speed  of  response  and  graphic  quality.    Moreover,  it 
was  anticipated  that  the  bulk  of  the  topics  to,  be  treated  could  . 
be  represented  tiy  differential  equations. 

Theoretically,  at  least,  the  design  of  a  hybrid  system- to  . 
accomplish  this  is  reasonably  straightforward.    First,  the  duty 
of  solving  and  plotting  the  differential  equations  is  assigned 
to  the  analog  computer  —  the  obvious  choice  for  reaso'ns  of  spe^ii. 
and  graphic  quality.    To  the  digital  computer  is  assigned. the  role 

V 

of  controlling  the  time-sharing,  i.e.,  servicing  the  time-shared 
terminals.    However,  to  deal  with  any  differential  equations  t^iat 
any  user  might  specify,  it  will  be  necessary  for  each  user  tp 
have  the  analog  computer  patched  to  his  speci filiations  for  hi^^ 
problem.    Speed  limitations  alone  rule  out  patch  paitels  at^ 
\^rminals;  the  alternative  is  an  automatic  patch  Cord  mechanism 
or  autopatch.    Autopatch  can  6.e  achieved  with  a  large^^ntunber^f  . 
high  speed  switches  which  are  each  under  control  of  tHe  digital 
computer.    At  this  stage  in  iJ^e  design,  then,  a  user  would  have 
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to  write  out  the  analog  program  for  his  problem,  look  up  the 
switch  addresses  to  implement  it,  and  then  instruct  the  digittl 
computer  to  close  then«  "  This  is  messy  and  hardly  meets  the  goal 
of  freedom  from  programming  and  language  format.  Fortunately, 
there  is  another  way. 

Those  who  are  familiar  with  the  general or  bootstrap 
method  of  solving  differential  equations  with  an  analog  computer 
must  recognize  the  mbchanical  nature  of  the  approach:    1}  isolate 
the  highest  derivative,    2)  integrate  and  manipulate  uriti^l  all 
terms  equal  to  highest  derivative  havi9  been  generated,    3)  close 
loop,    4)  set  initial  i^onditions.    Given  some  specific  complement 
of  analog  components  and  the  switch^'^mths  between  them  it  would 
seem  quite  feasible  to  have  the  digital  computer  step  through  an 
algorithm  to  determine  wh^t  is  to  be  connected  to  what  and  then, 
with  the  control  it  has  over  the  switches,  to  make  the ^proper 
connections.    In  summary,  then,  if  we  have  the  autoprogramming 
capability  just  describett,  and  the  autopatch  capability  described 
earlier,  it  would  then  become  a. simple  task  to  have  the  digital 
computer  recognize  an  alphanumeric  statement  of  a  differential 
equation  and  solve  it  on  \he  analog  computer. 

However,  there  yet  remaiii  two  hurdles  to  be  cleared  before 
this  system  is,  theoretically  ^t  least,'  complete.    Parameters  must 
be  .adjustable  for  each  terminal.    There  ^re  a  (lumber  of  ways  of 
"doing  th3^s.    Potentiometers  can  be  arranged  at  each  terminal  and 
switched  in  and  out       dictated  by  the  digital  contputer  controller 
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Alternatively,  servo  set  pots  at  the  computer  could  be  controllecf 
by  the  terminal.    Yet,  for  operation  at  speeds  which  the  modem 
analog  computer  are  capable,  e.g.,  0,2  msec  initial  condition, 
1.0  msec  operate,  neither  method  is  satisfactory.    The  phase 
shifts  in  remote  pots  attendant  with  frequencies  to  be  expected 
at  these  speeds  ('v^lO  KHz)  void  the  former  possibility.  The 
sluggish  response  time  at  servo  set  pots  eliminates  them.  Since 
autopatch  requires  so  many  coefficient  devices,  we  can  only 
realistically  consider  time-^sharing  cpefficient  devices  which  can 
be  set  in  less  than  0.1  msec.    Such  coefficient  devices  do  exist: 
they  are  digital  to  analog  converters  capable  of  multiplying  an 
analog  variable.    We  will  call  them  MDACs  for  short. 

Finally,  there  is  the  annoying,  albeit  important ,^  require- 
ment  to  communicate^  user  desires  to  the  digital  computer.  A' 
keyboard  would.be  the  simplest  way,  but  a  character  generator  which 
would  enable  a  di^splay  of  the  statement  of  the  equations  is  required. 
A  stroke  generator  with  analog  output  would  be  preferable  to  be 
compatible  with  the  output  from  the  analog  computer. 

An  overall  schematic  of  what  has  just  been  described  is  shown 
>  in  Figure  1.    The  hardware  and  software  details  of  this  system  are 
described  in  the  next  section.    Following  that  is  a.  section  which 
summarizes  the  system  and  points  to  some  uses'. 

'       '  "  • 
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The  Construction  of  the  System 


Autopatch 


Several  methods  of  autop&tching  an  analog  computer  have  been 
(2-51 

proposed.  More  sophisticate'd  approaches  have  been  a  telephone- 

like trunking  system  to  reduce  the  number  of  switches  involved. 
Without  trunking,  the  number  is  simply  the  product  of  the  number 

of  inputs  (1)  and  outputs.    Since  in  analog  computing  there  are 

2 

about  twice  as  many  outputs  as  inputs  about  21    switches  are  re- 

*  15 
quired.    Sophisticated  trunking  can  lower  this  at  best  to  I  *  . 

However,  if  one  deals  with  a  small  number  of  analog  Components; 

sa^l5,  then  if  all  outputs  can  be  connected  to  ^11  inputs  only 

< 

450  switches  are  required.    Furthermore,  if  coefficient  devices 
(MDACs)  are  dedicated  to  each  output  so  that  they  need  not  be 
counted  as  a  separate  component,  the  15  components  can  represent  a 
respectable  computing  ability .    In  fact/  with  the  18  components  ^ 
shown  in  Figure  2,  it  was  felt  tnat  we  would  have  a  reg^sonable 
start  at  autopatch  with  the  capability  of  solving  the  usual  range 
of  equations  to  be  encountered  at  the  freshman  or  sophomore  level.*  , 
Furthermore,  the  job  of  the  programmer  designing^  the  autoprogratmning 
feature  was  simplified  by  avoiding  the  added  complication  presented 
Dy  trunk  net^^orks  between  output  and  inputs.    Consequently,  as 
.shown  in  Figure  2,  all  outputs  had  direct  connections  available  to 
all  inputs  whith  is  the  method  used  by  Howe,vet.  a}.^^^    Thus,  our 
initial  autopatch  system  contains  five  integrators,  three  multipliers 
one  divider  and  tnree  summers,  and  thirty^ MDACs  connected  together 
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by  540  switches.    A  thirty-first  MDAC  sets  initial  conditions. 

A  great  deal  of  effort  wentinta  the  design  of  the  analog 
switch.    It  had  to  have  a  low  phase  shift  (<.056*  at  10  KHz)  for 
acceptable  accuracy  in  high  spted  analog  computation.    Furthermore,  . 
when  the  switch  was  opon^  less  than  .1%  could  feed  through  at  IC  KHz. 
The  switch,  which  ^as  finally  used,  is  showi},  in  Figure  3.    It  has  ^ 
a  primary  and  a  secondary  me»ory  in  the  quad  latches  that  control 
each  swit'^s^   Consequently,  while  one  problem  is  running,  the  digi- 
tal con^uter  can  be  setting  the  quad  latches  for  the  next  problem 
and  when  the  time  comes  the  computer  can  set  all  540  switches  in  a 
single  command.    The  switch  and  the  series  resistance  are  adjusted 
to  an  Accuracy  of  0.1%  so  that  all  outputs  go  through  switches 
directly  to  summing  junctions  of  input  amplifiers.    The  summing 
capability  of  such  a  scheme  guarantees  that  we, have,  in  fact,  a 
summing  capability  which  is  greater  than  the  three  summers  would 
lead  one  to  believe.    The^switches  are  assembled  in  groups  of  six- 
teen  on  a  single  printed  circuit  card.    These  sixteen-switch  cards 
are  inserted  in  a  card  cage  alon^  with  communication  cards  which 
c^trol  thep  and  the  whole  assembly  is  then  mounted  oi)  the  front  of  . 
the  patch  panel  which  is  dedicated  to  Autopatch. 

With  tho  advent  of  tRe  MDAC,  the  last  moving  part  in  the  analog 
computer  is  Replaced.    That  is,  we  c^  now  take  fuM  advantage  of 
the  tremendous  analog  speed.    The  MDACs  used  in  this  system  are 
simply  an  array  of  precision  resistors  with  solid  $tate  switches/ 
The  ones  used  in  this  system  are  bipolar  and  have  12  bit  accuracy. 
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They  are  double-buffered,  i.e.,  ^ey  have  a  primary  and  secondary 
storage,  so  that  like  the  swit^j^  they  can  be  prepared  for  the 
next  problem  while  the  present  one^is  ruxming  and  then  all  set  with 
a  single  command.  ,  *  - 

We,  t-Kereford,  have  an  autopatch  system  in  which  a  problem 
can  be  patched  and  the  coefficients  ,*set  in  two  poBOiands^or  about 
20  u  seconds  with  a  Nova  digital  conputer. 
Character  Generator  *  ^ 

The  characte-r  generator  is  basically. an  analog  function 
generator  for  the  X  and  Y-axis,    As  s^dwn  in  Figure  4a,  inputs  of 
+,  0  ot  -  reference  voltages  can  be  switched  into  an  integrator 
during  the  period  of  a  clock  pulse  to  generate  a  negative  ramp,  a 
horizontal  bar  or  a  positive  ramp  respectively,    X  sequence  of  32 
clock  pulses  ^tivates  *in  turn  32  different  control^lines  and  then 
repeats.    The  32  control  lines  are  connected  to  a  gated  decoder  for 

.X  and  Y  and  the  \  va  t  and  Y  va  t  patterns  shown  in  Figure  4b  are 

^     ,   .         ^  ■  O 

obtained.    An  X       Y  plot  produce^  the  image  shown  in  Figure  4c. 
The  intensity  or  Z  -axis  sequence  will  determine  which  stroke  is 

unt>lank0d  and  thus ^ what  the  character  will  lo<Jk  like.    Th^  z  axis 

f 

sequence  is  produced  as  follows.    The  32  control  lines  mentioned 

above  also  each  go.  to  an  array  of  32  gates  which  tire  **Anded*»  with 

the  output  from  a  re ad -oh ly -memory  (diode  board) .    The  read-only- 

I 

memory  is  actj^vated  by  a  seven  bit  ASCII  string  from  the  digital 
computer.    Two  eight  bit  strings  are  required 'to  set-  the  DACs  for 
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the  X-Y  position  of  the  character.    Four  more  bits,  control  the  four 
^  switches  for  input/output  impedances  of  the  last ' operational  ampli- 
fier.   This^inables  the  digital  computer  to  select  16  character 
sizes  and  the  spacing  will  be  automatically  adjust<^d.    The  digital 
computer  need  only  devote  the  time  reqliired  to  send  two  command 
•  words  ('^'10  M  seconds  for  th^  Nova)  to  produce  a  character.    The  speed 
with  which  the  character  is  generated  then  depends  jon  the  clock  rate. 
Characters  can  be  generated  as\  quickly  zs  32  mi|^osecdnxls  MHz 
rate)  although  for  a  storage  oscilloscope  display  0.64  milliseconds 
(50  KHz  rate)  is  the  fastest  rate  possible  for  storage.    At  present 
one  character,  generator  services  all' terminals  ; 

Interface  .  '     '  '      ^  ' 

•The  details  of  jthe  keyboaifds  -  CRT     analog  computer  (AJ)/5)  - 
charact|Br  generator  -  digital  computer  (Nova)  and'^ analog  swit.ches 
,  interfacing  are  summarized  in  Figure  5.    Two  interfaces'  O/F)  are 
mounted  on  an  accessory  card  inside  ihe  Nova  computer.    The  32  bit  > 
I/F  effectively  controls  what  the -user  will  see  while^the  12  bit 
I/F  brings  in^he  data  from  the  keyboards.    In  addition  to  the  27  ^ 
'  of  the  32  bits  identified  in  the  character  generator. section,  four 
are  for  the  address  of  the -terminal  and  one  is  an  erase  pulse  for 
\       the  storage  oscilloscope.    The  character  generator  has  an  analog* 

and  a  logic  switch  system  to  determi-ne  whether  the  X,  Y,  and  Inten- 
sity signals  come  from  the  AD/S  or  the  character  generator.  A  \ui)  , 
comman^from  the  Nova  sets  the  switches  for  character  generator 
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outputs,  otherwise  AD/5  outputs  go  through.    After  the  Era^e 
commaxi4  has-been  given  nearly^  0.5  se^cond  i^  required  before  the 
storage  oscilloscope  can  be  used  again.    Busy  is  an  ii^coming  bus 
from  a  flip  flop  in  the  oscilloscope  vrhich  is^high  only  during  this 
interval.    The  Nova  can  then  test  the  Busy  bus  periodically  rather 
than  usp  computer  time  to  count  the  0.5  seconds.    The  selection  of 
th«  terminal  with  access  to  the.Datk  bus  occurs  as  follows.  When 
$  uiter  has  a  character  'loaded  into  the  keybbard  register,  then  the 
next  time  the  50  KC. Line  Available  bus  is  high  that  terminal  will 
stop  the  Line  Available  signal  at  the *NoVa  and  then  clock  out  the 
seven  bits  ori  the  Data  bus  until  th6  software  sets  Done  at  which 
time  the;  Liae  Available  is  activated  again. 

The  interface  between  the  Nova  and  the  AD/5  has  a  powerful 
poramunication  linkage.    It  is  this  interface  which  sets  the  MDACs. 
Moreover,  a  cable  containing  16  data  lines  and  6  command  lines 
Gohriects  this  interface  to  the  ai*r^y  of  analog  switches.  The' 
"s^witches"  are  set  with  a  •seirial  bir  ^string  at  a  cost  of  one  bit/ 
switch.    Thus, -once  initiated,  each  digital  word  sets  16  switches.^ 

*.Text  and  an  unexpected  audio  dividend 

While  the  terminal  has  tne. graphic  capability  to  dispWy  teoct 
at  a  cost  of  2  cHgital  words/alphanumeric  character,  the  8K  memory 
of  the  Nova  will  allow  only  a  limited  amount  of  text  to  be  stored 
in  core)    Yet  it  would  be  helpful  if  an  instructor  could  compose 
a  CAI  lesson  which  would  consist  of  passages  of  text  alternating  ^ 


witfi  computational  computer  graphics  under  contrc^l  of  tl>e  user. 
However,  as,  t^  library  of  these  lessons  increased  in  number  it 
was  clear  that  they  could  not  all  be  available  from -core. 

,  An  alternative  was  to  record  the  texXon  magnetic  tape  cassettes. 
Each  terminal  would  then  be  provided  with  a  playback  device.  When 
the  "cassette  recorder  is  connected  to  the 'keyboard  and  playback 
^  initiated,  the  digitfil  ji formation  ^toing  from 'the  tape' appears  to 
the  Nova  as  a  very  rapid  typist.    The  magnetic  cassette  has -large 
storage  capability  so  that  any  number  of  lessons  could  be  maintained 
in  ^  library.    All  a  student  need  do  ^  pick  out  the  cassette  with 

the  desired  lesson  and  fast  forward  to  the  counter  reading  where 

t  <■ 

-the  lesson  begins . 

{  • 
The  cassette  player  is  a  comnercial  device  which  has'  dual 

channels  for  stereo  use.    One  channel^is  committed  to  digital  in- 

formfeion  whith  still  leaves  the  other  for  audio.    This  unexpected 

dividend  makes  it  possible  for  the  CAI  lesson  to  have  graphic  and 

audio  presentation  simaltaneously  for  little  additional  cost,. 

Hardcopy 

A  useful,  although  sometimes  expensive,  feature  of  a  graphics 
terminal  is  the  ability  to  give  hard' copy.    This  can  be  done  in  a 
straightforward  manner  with  the  circuit  shorn  in  Figure  6.  The 
circuit  samples* a  repeating  analo'g  signal  and  slowly  moves  the 
sampling  Jwindow  across  it  without  ever  affecting  the  speed  of  the 
systemHt  is  sampling.    By  doing  ^his  to  both  X  and*Y  coordinates 
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it  is  possible  for  an  X-Y  recorder  to  record  in  several  seconds  a 

graph  which  has  been  calculated  in  1  msec  several  thousand  times. 

The  hardware  for  this 'sampling  system  is  available  on  the^AD/5  and 

can  be  permanently  patched  on  the  Autopatch  panel.    Presently  there 

is  a  single  sampL»ing  bus  on  the  system  and  a  single  'X-Y  recorder 

that  can  be  used  at  any  terminal. 

Software 

:      An  important,  if  not  crucial-,  aspect  of  writing  the  software 
for  the  system  was  the  ability  to  write  it  on  cards  to  simplify 
editing.  ,  This  was  particularly  true  for  the  Autoprograjnming  section. 
However,  our  Nova  only  had  a  teletype-paper  tape  input.  Accordingly, 
some  time  was  spent  initially  writing  an  emulator  for  the  Nova  on 
tne  I^M  1130  which  resides  at  the  Reec^i^llege  Computation  Center, 
rhus  virtually  all  programs  were  written  and  tested  on  the  1130. 
Only  then  were  the,  cards  converted  to  paper  tape  and  loaded  into  ^ 
the  Nova. 

An  overview  of  the  software  is  given  in  Figure  7.    There  are^ 
.^^'effectively  two  different  functions  being  performed  by  the  CPU 
with  buffers  as  a  link. 

.Consider  the  following  example.    Suppose  that  Control  is 
having  Autoprogram  *work  on  someone  *s  problem  when  sojneone  at  another 
terminal  depresses  a  keyboard  key.    This  causes  a  Control^ Interrupt 
due  to  a'Cha^act^r  coming  in.    This  is  the  highest  priority  so 
Control  p^ses  while  the  character  is  entered  ipto  the  200  word 

t 


Character  buffer  allocated  for  that  terminal;  then  Control  resumes 
its  job.    Whfen  the  character  stream  for  enteriW  the  problen^by  the 
user  is  complete,  he  depresses  a  control  )?ey  followed  by  A.    The  * 
next  time  that  terminal  is  serviced  the  characters  in  the  Character  • 
buffer  will  undergo  an  Auto'progrtm.    When  the  Autoprogram  is  co^- , 
pleted  the  Character  buffer  now  contains  the  -autopatch  switch  settings 
and  the  buf'fjsr  is  transferred  to  a  ISO  word  Plot  buffer  for  that 
,   terminal.    Control  tSen  requests  the 'temp-nal  user  to  ftssign  values  p" 
1  for  each  coefficient  identified  in  the  problem.    These  inputs  are 
identified 'by  the  contrdl  key  followed  by  C  and  are  entered  in  the 

Character  buffer.    When  Coefficient  sees  that  all  coefficients  have 

,      /  • .        *  -         ^  . 

been  given  a  value  it  stops  asking,  transfers  »thfe  buffer  to  the 

t  Plot  buffep  ^d  awaits  a  run  command.    The  user  , initiates  thi's  by 

depressing  the  control  key  followed  by  R.    The  next  time  Control  is 

interrupted  by  the  AD/S  signifying  that  it  is  ready  and  it  is" the 

turn  for  thjis  temiinaU  the  Plbt  buffer  will  be  transfer^d  to  the 

AD/5  and  then  a  command  given  to  computer.    The  run  commahd  can  also 

cause  Rese't  to  clear  tioth  buffers  for  that  terminal. 

*  • 

...  /  ^ 

^Summary 

A  hybrid  CAI  system  has  been  described  which  has  the  inter- 

*  * 

active  capacity  of  a  digital  computer  and  the  speeH  of  an  analog 
computer  with  an  auHio  capabilit;y  thrown  ih.    It  would  appear  that^ 
such  a  system  offers  great  potential.    It  can  service  a  large  number 
of  users  solving  any  number  of  differential  equations  at  a  speed 
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which  has  not  been  achieved  before.    Thjls  hybrid  time-share  system 
/'is  a  small  step  in  a  direction  Wh^^  could  lead  to  a  fruitful 

partnership  of  digital  and  analog  techniques. 

It  is  too  early  tp  tell  at  this  writiiig  what  combination  of 

text",  graphics,  and  audio  is  best.    It  is  too  early  to  say  how 

effective  thi$  system  will  be  as  an  instpnictional  device.  Specific 
.  example? -of  the*.use  of  the  system  will  be  given  at  the  presentation 
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OVERVIEW    OF  SOFTWARE 


CHARACTER 
IN 


AD/5  or 
dHAR.vGEN. 


Characfer 
Buffer 


CONTROL 
INTERRUPT 


/ 


/ 


/ 


/ 


/ 


/ 


/ 


AUTO- 

PROGRA  M 

 A_ 

 ,_  


\ 


\ 


\ 


CONTROL 


7\ 


RESET 


Plot 
Buffer 


\ 


\ 


\ 


^1 


\ 

AD/5 

\ 


\ 


\ 


\ 


COEFFICIENT 




ERIC 


9-U 


Laboratory  Simulation  and  CAI:   A  Rationale 


i 

M.  T.-  Muller 

Computer  Based  Instuuctibnal  Systems,  Inc< 
Austin,  Texas 

Introduction 


The  use  of  computer  modeling  and  simulation  during  the  past 

twenty  years  evolved  initialfythrough  mij.itary  applications  such 

as  war  gaming  and  human  operations  systems  research  efforts. 

Within  the  past  ten  years  heavy  emphasis  on  the  use  of  simulation 

for  training  purposes  in  the  space  flight  program  has  resulted  in 

the  development  of  sophisticated  instrumentation  that  was  designed 

*for  computer  compatibility  for  real  time  applications.  The 

resultant  "spin  off"  of  many  of  these  simulation  techniques  and 

associated  devices  have  been  of  a  major  benefit  to  commercial. 

V 

incjjistry  in  adaptation  to  in-house  training  programs.  Examples' 
of  the  use  of  simulation  arc  the  lunar  module  flight  simulator 
which  allowed  the  astronaut  to  achieve  a  high  degree  .of  flight 
proficiency  before  the  actual  space  tlight. 

m 

Despit?e  all  of  the  progress  achieved  within  the  spav-c  program 
new  fnnovative  techniques  in  chemical  education  have  been  limited 
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to  traditional  use  of  multi-media  and  more  recently  to  computer 
based  instruction.    There  .is  a  bright  note  m  this  because  many,  of 
the^imulation  l.echniques  so  successfully  used  in  the  space  pro'gram 

■  \ 

can  be  usefully  translated  and  applied  to  the  field  of  laboratory 
instruction  in  ch-emistry.    The  purpose  of  this  paper^'is  to  discuss 
in  so^me  detail  .the -practical  aspects  of  the  use  of  computer  based 
simulation  i^)  in  undergraduate  coUeg^'e  chemistry  laboratory 
instruction.    The  major  objective  of  ar>y  teaching  technique,  be  it 
traditional  or  innovative,  is  to  make  student  learning  a  more 
meaningful  and  enriched  experience  in  education. 

Laboratory  Simulation 

It  is  commonly  accepted  that  the  role  of  laborat^ory  work 
» 

in  undergraduate  chemistry  courses  is  to  provide  the  student  with 
opportunities  (a)  to  use  equipment  and  gain  some  decree  of  exper- 

c 

ience  with  laboratory  techniques,  and  (b)  for  decision-making 
based  on  experimental  data  as  well-as  for  manipulation  of  these 
data.    Thus,  a  technique  .may  be  taught  in  the  laboratory  and  then 
belabor,ed--apparently  endles sly--to  show  the  student  how  it  may 


1.  A  definition  of  computer  simulation  is  quoted  from 
"Computer  Modeling  and  Simulation"  b/        F.  Martin,  John  Wiley 
and  Sons,  New  York,.  1968,  page  5  as  follows: 

A  computer  simulation  model  is  a  logical  mathematical 
representation  of  a  concept,  system  or  operation  programmed  for 
solution  on  a  computer. 
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be  used  to. obtain  e^erimental  results  directed  toward  a  variety 
/ 

of  ends.    For  example,  it  is  not  uncommon  to  have  students  do 

« 

acid-base  titi^ations,  equivalent  weight  determinations,  oxi'dation- 
reduction  titrations,  complexo metric  titrations,  analyses  of 
various  elemei)ts  using  titration,  and  mjethods  analysis  of  biological 
systems.    All  of  these  experiments  involve  a  basic  titratioii>tech- 
nic^e,  yet  the  student  might  spend  numerous  laboratory  periods 
{in  clifferent  courses)  doing  essentially  the  same  thing,  but  for  a 
-slightly  different  reason  each  time.    Similar  situations  exist  in 

I* 

teaching  many  different  experimental  techniques. 

Direct  labpratory  involvement  is  necessary  to  jicquire 

s 

manipulative  experience;  however,  conrlputet  methods  can  be 

'  ■  \ 

\ 

devised  to  provide  experience  in  computational  problem  solving, 
^decis  i  o  n -'T^iaking  and  in  the  manipulation  of  experimental  results. 
Furthermore,  one  of  the  characteristics  of  a  university  education 
is  tcJ  develop  the  ability  to  interpret  trends  and  concepts  to  bolster 

one.' s^  intuition  during  all  of  the  above  processes. 

i 

The  logistics  problems  associated  with  laboratory  instruc- 
»^  • 
t^on  often  impose  an  artificial  constraint  on  the  type  of  experiments 

t 

that  can  be  performed  by  the  student.    Laboratory  instruction  in 
^^any  chemistry  courses  has  not  kept  pace  with  the  theoretical 
concept^  introduced  in  the  corresponding  lecture  portions  of  the 
same  course  for  several  reasons.    First,  the  number  of  students 
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involved  las  increased  to  the  point  wl\ere  use  of  physical  facilities 
nriust  be  severely  restricted  to  accommodate  everyone;  schedules 
are- crowded  and  logistics  demand  that  experiments  must  be  per- 
formed which  require  no  more  than  three, to'four  hours  of  laboratory 
time--a  very  artificial  situation  in  the  real  world  of  chemistry* 
1^    Second,,  funds  are  not  generally  available  for  the  costly  equipment 
required  to  perform  experiffients  whilch  would  illustrate  the  more 
sophisticated  concepts  taught  in  the  lecture  portions  of  the'course. 
Third,  many  experiments,  although  simple  in  principle,  require 
several  periods  for  their  completion  and  canpot  be  performed  for 
lack  of  time  available  to  the  student.    The  lack  of  time  can  often 
be  traced  to  the  factors  described  aboVe  and/or  the  idiosyncracies 
of  the  student's  schedule.  -  / 


Experiments  Requiring  Complex  Equipment 


parly  in  their  careers  many  students  are  capable  ^f  handl- 
ing the  results  of  experiments  which  require  apparatus  too  co^mplex 
for  their  manipulative  expertise.    Indeed,  to  acquire  a  working  f 
k'Jiowledge       much  of  the  apparatu'5>'used  to  obtain  results  which 
are  commonly  quoted  in  beginning  science  courses  would  require 
more  time  than  is  available;  such  experiments  suffer  the  additional 
disadvantage  that  the  equipment  is  usually  available  only  for  ^ 


research  purposes. 


ERIC 


/V 


9-68 


Emission  spectroscopy  is  4  vdry  good  example  of  an  area 
from  which  modern  general  chemistry  courses  draw  considerable 
materiiil  but  a  representative  experiment  is  not  usually  included 

in  the  laboratory  portion  of  the  course.  ^  It  is  now  possible  to  simulat 

'  ^  \ 

a  spectroscopic  experime^nt  using^  computer  techniques  (2)  which 
can  be  pe^ormed  in  a  relatively  short  time.    With  such  a  program 
available,  early  in  his  career,,  the  student  is  required  to  underatand 
the  principles  on  which  a  spectrometer  works,  make  decisions 
concerning  the  data  collected,  as  well  as  the  manner  in  which  the 

)  ^ 

data  will  be  treated,  without  becoming  involved  in  the  (oftei? 

idiosyncratic)  vagaries  of  the  operation  of  a  spectrometer.  Pi:eJ- 

•  *& 

gumably,  the  student  will  be  exposed  to  the  latter  in  a  more  advanced 
laboratory  course  when  and  if  this  experience  is  important  to  his  ' 
profes  sional  dfeveloppnent ,    Many  classical  experiments  in  the 
Sciences  can  be  placed  in  this  category.  * 


/  Time-Compression  or  -Expansion  Experiments 

Cpmputer  techniques  are  now  being  used  to  simulate  experi- 
nients  which  take  either  unacceptably  long   or  very  short  periods 
of  time  to  perfo'rm.    For  example,  there  are  many  kinetics  experi- 


2*  A  course  in  spectroscopy  has  been  programmed  and 
demonstrated  at  the  Univei>sity  of  Texas  at  Austin  on  the  IBM 
1500  system  by  Lagowski  and  Castleberry  in  1968. 
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ments  that  require  relatively  simple  equipment  (which  is  accept- 
able frdm  a  financs^al^standpoint  for  large  numbers  of  students); 
howev^r^  these  experiments  requirie  periodic  sampling  for  10  to  24 
hours  which  is  unacceptable  ina  class.    Experiments  involving  , 
radioactive  substan^fed  also  fall  into  this  category;  in  thj-S  instance^ 
the  cost  of  equipment  as  well  as  the  time  available  to  do  the  experi- 
ments can  be  simulated  using  computer  techniques  and  performed 
on  a  more  convenient  time  scale,   "Thus^  computer  simulation  can 
be^u&edto  compress  the  time  required  for  an  experiment  that  might 
take  several  days,  or  expand  time  for  an  experiment  that  might 
take  only  a  fraction  of  a  second.  ,  A  good  example  b'f  this  type  of 
simulation  is  the^  Millikah  Oil  Drop  Exneriment*  In  either  case, 
the  experimental  data  dould  be  handled  by  the  undergraduate  student 
even  though  the  manipulation  of  the  apparatus  and/or  the  detailed 

planning  of  the  experiment  might  be  beyond  his  capabilities.  An 

4  ' 

additional  benefit  arises  from  the  abiUty  of  the  student  to  process 

raw  data  rapidly  when  he  attempts  to  fit  such  data  to  theoretical 
modert. 

Advantages  of  Computer  Simulated  Experiments 

Computer  techniques  judicially  used  in  conjunction  with 
live  experiments  can  lead  to  a  more  meaningful  laboratory. 
Actual  laboratory  time  can  be  gained  if  simulated  experiments 


are  use^  in  place  of  live  experiments  which  are  performed  repeatedly 
to  gather  data  on  different  systems  for  decision  making  purposes; 
the  time  saved  can  be  used  to  expose  students  to  experiments  whi<!:h 
normally  could  not  -be  performed.    Using  simulation,*  students 
can  be  exposed>o  relatively  sophisticated  experiments  early  in 
their  traliiiTig.    In  addition,  some  experiments  can  be  extended 
into  ar^as  that  would  be  impractical  in  a  real  laboratory,  because 

•    *  V  *  .     ^  » 

the  experiments  are  hazardous  and/or  the  equipment  too  expensive. 

Examplesof  this  include:  a  series  of  individual  simulated 
experiments  taken.by  students  at  the  University  of  Texas  served 
as  a  vehicle  for  demonstrating  the  unity  of  experimental  work 
directed^toward  a  given  chemistry  laboratory,  simulated 

experiments  involving  qualitative  analysis,  certain  qualitative 
techniques,  molecular  weight  determinations,  and  spectroscopy 
.were  organized  for  the  purpose  of  identifying  and  characterizing  . 
unknown  substances..    An,attemptS<^^o  this  in  actual  practice  would 
undoubtedly  involve  mere  time,  equipment,  and  space*than  would 
be  available  iji  a  normal  laboratory  course.    However,  the  equivalent 
in  computer  simulated  experiments  was  accomplished  in  several 
afternoons.    Indeed,  it  is  predicted  that  in  the  evolution  ol  these 
techniques  future  laboratory  courses  will  be  developed  which  incor- 
porate a  subtle  blending  of  real  laboratory  experiments  and  compxiter 
simulation  to  the  point  where  the  student*  will  at  times  feel  the  two 
are  inseperablel  ^ 
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Finally,  it  can  be  expected  that  considerable  savings  in' 
terms  of  spac^,  equipment,  and  expcjndable  supplies  w^Tuld  acc^rue 
from  laboratory  simulation  techniques;    Computer  simulation  can 
be  used  to  advantage  tq  alleviate  many  problems  associated  with 
laboratory  instruction  of  relatively  sophisticated  material  to 
p     large  numbers  of  students.  ^  - 

,  ^ « -    *  », 

H  '  Computed  Control  of  Experlrrtejits 

The  nature  of  experimental  work-irf'  modern  chemistry  is 
,    being  gret^tly  influe'nce'd  by  the  development  of  computpr^ystems  '  ^ 
that  can  intervene  directly  in  real  time  with. the  course  of  an 
J     *  experiment.    It  is  now  possible  through  cTomputers  to  collect  sev-4 

eral  kinds  of  data  virtually  simultaneously,  process  the  data  in  a 

•  ^  '        ^  '      .    ,  \^ 

standard  way,  make  a  judgment  (either  by  the  experimenter  or  the 

'  computer)  of  the  results,  and  intervene  in  the  experiment  while  it 

is  still  in  progress.    Such  applications  now  are  cotnmon  in  industry, 

but. the  undergraduate  junior  or  senior  in  the  average  advanced 

laboratory  Qourse  knows  virtually  nothing  of  such  computerized 

data  collection  technique's^*    The  development  of  cotnputer  controlled 

(or  monitored)  experiments  for  use  in  advanced  undergraduate 

laboratories  xis  but  one  of  the  newer  capabilities  urgently  needed- 

JSaVeral  years  of  experience  abtained  in  the  application  of 

comp-uter  based  Techniques  in  general  chemistry  and  in  organic 

ERIC  , 
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chemistry  strongly  suggest  that  these  methods  could  be  sue-- 
cessfuUy  applied  to  every  level  of  instruction  in  technical  and 
vocational  courses.    A  description  of  the  individual  modules 
developed  for  introductory  chemistry  courses  by  Computer-Based 

0 

Instructional  Systems,  Incorporateci,is  shown  in  Appendix  A.  A 
discussion  of  a  typical  example  of  the  use  of  simulation  techniques 
will  now,  follow* 

PHOMA    {See  Appendix  B) 

This  rnodule  is  a  simulated  experiment  in  th(e  use  of  spectro- 
photometers for  quantitative  analysis.    The  procedure  is  based  upon^ 
that  describe^  in  ASTM,  Part  32,  pp  78-80,  May,  1965*  Essentially 
the  experiment  requires  that  the  student  demonstrate  he  is  familiar 
wilh  the  outlined  proce'dure  and  then  collect  and  analyze  data  as  if 
he  were  in  a  real  laboratory  environment.    The  students  *  ^results  . 
are  then  examined  for  accuracy  and  recorded  through  the  use  of 
a  sister  program,  PERlvtA.    It  will  be  necessary  to  assign  each* 
student*  an  ^'unknown"  number.    The  "actual"  value  of  the  unknown 
sample  will  be  based  upon  the  number  assigned  as  follows: 


Any  number  less  than: 

%Mn 

.  1000 

0.017 

2000 

0.41 

3000 

0.35 

4000 

0.49 

5000 

0.62 

6000 

0.  75 

•  .     ,  7000 

1.25 

8000 

1.45 

9000  . 

0.82 

A  rarldom  experimental  erroV  is  generated  to  avoid  total  duplica- 
Sion  of* results. 

A  typical  student  interactioji  follows.    (Student  input  is 
•  I  / 


J) 
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capitalized  and  underlined^fbr  clarity.) 


Your  friendly  CBIS  chemistry  tutor  bids  you  welcome* 
Please  enter  your  student  number  ?  2  ^ 
Thank  you,  Mark 

Which  of  the  pro,grams^|jJ^d  in  your  index  would  you  like  to  take 
today?    Type  in  the  name  of  your  'choice?  PHOMA 
The  Spectrophotometric  Determination  of  Mn  in  Steel 

I 

Have  you  read  the  experimental  procedure?    YES  | 
Good.  .  •whaft  is  the  maximum  percent  of  Mn  that  is  suitable  for 
this  e:£periment?  5^ 

O.K.,  swe\l.»,at  what  wavelength  in  millimicrons  will  measurements 
be  taken  using  the  spectrophotometer?  545 

Great...  we  are  on  the  same  wavelength.  what  is  the  substance 
that  imparts  color  to  the  solution  that  is  analyzed  spcctrophoto- 
metrically?    PERMANGANATE  ION 

Mighty  fine.  •  #1  really  believe  that  you  are  ready  to^perform  this 
experiment.  .  . 

Now  pleaae,  enter  your  assigned  experiment  number?  >  1500 

What  weight  (ih  grams)  of  99.9  pencent  pure  Mn  do  you  wish  to 
dissolve  In  hot  HNO3  in^preparation  of  the'standard?  2,  0 

 you  are  out  of  the  optimum  range  for  preparing  the  standard 

froni  pure  Mn  .  •  Please  .examine  the  experimental  details  and  '  ^ 

andwer  again.  .  .  .    ,  '  . 

What  weight  (in  grams)  of  99.9  percent  pure  Mn  do  you  wish  to 

dissolve  in  hot  KINO3  in  preparation  M  the  standard?    >  5 

Fine.  ♦  ♦the  sample  has  been  dissolved  in  a  minimum  of  hot  HNO3 

and  allowed  to  cool  to  room  temperature.    To  what  volume  (in  ml) 

do  you  wi3h  to  dilute  this  solution?  100 

The  concentration  of  solution  *A'  is  now  5  mg  Mh/l.OO  ml. 

What  volume  (in  ml)  of  solution  'A*  do  you  wish  to  dilute?  1 
To  what  volume  (in  ml)  do  you  wish  this  aliquot  diluted?  150 


/ 


****#Your  standard  solution  contaihs  0.  0333  mg  Mn/l.OO  ml. 

You  might  want  to  check  this  calculation  at  the  conclusion  of 
the  experiment  to  verify  my  results  v 

Very  good.  .  ,now  J  will  take  whatever  volume  of  this  standard 
you  would  like,  add  KI03  ,  dilute  to  50  mU  heat  for  30  minutes  to* 
develop  the  color  and  allow  to  cool  to  room  temperature.  \ 

Now  please  enter  separately  5  volumes  of  the  standard  ^ution  to 
be  so  treated.  . • 

Calibration  solution  volumes: 
#1.  ?  5 

#2.?  10  A 
#3.  ?  11/^  / 

#4.  ?  25^  '[         .  \  ^ 

#5.  ?  30  ^ 

Thank  you.  .  .the  solutions  ha^(:«(rbeen  treated  to  develop  the  color  an< 
analyzed^  spectrophotometrically.    The  following  absbrbapces  have 
been  ^bta  ined  .* .  • 

Solution  Absorbance  ^ 

i^iffi^:/^  ^  1   0.  21 

3«^3«c3«c3^:^     2   0. 33  ♦  * 

3^:(c3«c:^:^  #  3   Q.  47 

^  4   0.  71 

^  5  n--0. 87 

***:tc5ic ji^g  cell  correction  for  absorbance  is  found  to  be  0.03 

O.K.  >  let  us  now  gather  data  about  the  unknown  sample*./* 
«****vveight  of  steel  sample     beaker  s  3.  0321  grams 
«««**;<wv^,ight  of  beaker  «  2,5916  gr^ms 

I  have  now  prejiared  the  unkno^vn  sample  for  analysis  and  diluted  it 
to  100.00  ml  as  is  described  in  the  experimental  procedure. 

 what  volume  (in  ml)  of  this^solution  dZ  you  wish  to  use  as  an 

aliquot   ^ 

*****sample  aliquot? 

To  what  volume  {in  ml)  do  you  want  to  dilute  this  aliquot. .  . 
.vgample  dilution?  5£ 
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The  diluted  unknown  sample  solution  has  been  analyzed  with  a 
spectrophotometer.  •  • 

*****  the  absorbance  of  the  sample  solution  is  '0»29- 

*****  the  cell  correction  for  absorbance  was  found  to  be  0,  01 

Determine  the  percent  of  Mn  in  your  sample.  "When  this  is  completed, 
login  with  program  *PERMA*.    I  thank  ye^kindly. 

The  students*  experimental  results  are  then  checked  by 

the  following  program. 


■  PERIvCA 

r 

In  the  program,  the  student  enters: 

1.  His  assigned  experiment  number 

2.  Plot  or  ratio  determination  « 
3a.  Mg  of  Mh  from  the  curve,  or, 

3b.  Corrected  absorbance  of  the  Unknown  sample, 

•and  if  3b, , 
3c.  Ratio:   mg  Mn::absorbancy 
*  4.  Sample  aliquot  ^ 

5.  Sa^mple  dilution 

6.  Sample  w-eigrft 

A  check  of  the  students'  calculated  percent  and  actual  percent  Jls 
then  performed  and  the  results  output  to  the  student.    This  includes 
a  theoretical  "grade"  bas^d  upon: 

Difference:  CaPc  &  True  %  y  .  Grade 


I 


<  0.021  '  A 

<  0.051  r  B 

<  O.U  p 

<  0.15  D 
>  0.15  .  F 


) 


sample  student  interaction  foUow\s. 

Your  friendly  CBIS  chemistry  tutor  bids  you  welcome. 

Please  enter  your  sfutient  number?  _2 
Thank  you',  Mark 

W**ich  of  the  programs  listed  in  your  index  would  you  like  to  take 
today?    Type  in  the  name  of  your  choice  ?  PERMA 
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Welcome  back,  • .  ^ 
Enter  your  assigned  experiment  numbef  ?  1500 

Did  you  determine  the  percent  of^Mn  by: 

1.  plotting  absorbance  vs  mg  Mn/50  ml 

2.  ratio  of  mg  of  I^n  to  absorbancy 

Please  answer  by  number.  ^  \ 

Complete  the  following: 

*****  Corrected  absorbance  of  sample  z  ?  *  28 

*****  Ralio:   mg  of  Mn::absorbance  «  ?  1 , 20 

*****  Sample  aliquot  (ml)  «  ?  JIO  ^  * 

*****  Sample  dilution  (ml)  s  ?  50  ,/ 

*****  Sample  weight  (grams)  m  ?  .  4405 

Well  now,  I  calculate  your  percentage  as  0.381  and  the  actual 
percentage  .of  the  unknown  is  0.41.    Not  bad,  not  ba.d  at  all,  old 
cold  hands,  for  if  Pm  scoring  your  results,  your  grade  m  B 

Thanks  for  being  here,  partner.  .  . 


'  It  should  be  emphasized  that  although  this  paper  may  appear 
to  be  cast  in  a  static  context,  the  state  of. the  art  is  rapidly  changing  " 
because  innovations  in  hardware  and  software  are  characteristic 
of  computer  technology.    Experience  in  using  computer-based  * 
instructional  modules  is  being  gai^^**^  continually  with  larger  and  larger  ^ 
groups.    The  amplification  and  daily  acceptance  of  these  innovations 
on  a  broad  spectrum  of  subjects  is  a  goal  that  is  within  sight  and 
will  become  a  reality  in  time.    How  rapidly  this  will  be  accomplished, 
one  cannot  say  for  change  in  education  is  a  slow  evolutionary 
process  that  is  tempered  with  caution  and  conservatism  to  -  ' 

extreme  degree.    The  final  decision  for  acceptance  will  rest  with  the 

tor  be  it  good,  bad  or  indifferent! 

J  ,  510 

o 
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APPENDIX  A 

Module  Area  Mode  ^ 


1. 

METl 

The  Metric  System 

Drill ' 

2. 

MET2 

The  Metric  System 

Applied  Drill 

3. 

TEMCO 

Temperature  Conversions 

Drill 

4. 

DECAL  ^ 

<• 

Density  Calculations 

Applied  Drill 

PETEL 

Percentage  Calculations 

Applied  Drill 

L 

\j  • 

GASl 

TKe  ,Gas  Laws 

Simulation 

7. 

GAS2  , 

The  Gbs  Laws 

Applied  Drill 

8. 

EQBAL 

•Balancing  Equations 

Drill 

9. 

FORNfl 

Formula  Writing 

Drill^ 

10. 

NOBIN 

Inorganic  Nomenclature 

Applied  Drill 

lU 

NOTER 

Inorganic  Nomenclature 

Applied  Drill 

12. 

NOANE 

«    Organic  Nomenclature* 

Tutorial  Drill 

13. 

QUA^E 

Organic  Nomenclature 

Examination 

14. 

NO  GEN 

Organic  Nomenclature 

Applied  Drill 

15. 

COPRO 

CoUigative  Properties 

Simulatiojn 

16. 

PHOMA 

Instr  u  menta  1^  n3  ly  0  is 

Simulation 

17. 

"PERMA 

Data  Check  oCpHOMA 

18. 

BLEAP. 

Instrumental  Analysis  ^ 

Simulation 

19. 

PERCH 

Data  Check  of  BLEAP 

20. 

gr'ave 

Quantitative  Analysis 

Applied  Drill 
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APPENDIX  C 

•  t 

A  Partial  List  of  Institutions  Utilizing  Laboratory 
p  Simuj^ation  Techniques' 

/  Institution .  % 


University  of  California  at  Santa  Barbara 
Smith  College 

The  University  of  Texas  at  Austin 

Carleton  College 

University  of  Oregon 

University  of  Pittsburg 

Washington  State  University 
/ 

College  of  St.  Elizabeth 
Xavier  University 

N.  Carolina  Educational  Computing  Service 
Fairmont  State  College 
Florida  State  University 
University  of  Illinois 


Developer(s) 
Ewig,  Gerig,  and  Harris 
Fleck 

Castleberry,  Culp,  Lagowski 
Child,  Finholt,  Ramette 
Klopfenstein,  Wilkins 
Johnson 


Willett,  Breneman,  Kissler 
Sister  Callavan 


Corrington,  Marshal 

Denk 

Swig^r 

Mellon,  Dence,  Graham 


Smith/fi:??,  - 


erJc 


\ 
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USES  OP  CC»IPUTERS  IN  COMPENSATORY  COLI£GE-LEVEL  SCIENCE  EDUCATION 

>  • 

by 


Ralph  M.  Deal  - 
Chemistry^  Deparfepent 
Kalajoasiop  College 
Kala&aizoo,  Mlcjiigan 

r- 

•  INTRODUCTION t    COMPENSATORY  EDUCATION  AS  A  CURRENT  HIGH  PRIORITY 
NEED  IN  HIGHER  EDUCATION.  * 

/       Host  cumnt  interactive^  uses  of'  coMpcrters  in  prograxmed  * 
learning  are  too  expensive  to  use  on  a  scale  larger  than  research 
or  developmental*    One  area  where  the  use  of  a  comput«it  terminal 
by  a  student  or  a  saall  group  of  students  to  develop  specific 
concepts  and  sklllB  is  econ^cally  feasible  is  conpensatory 
education*    I  an  referring  specifically  to  nost  institutions  of 
hii^r  learning  in  the  United  States  bringing  black  students  and 
spanish-speaking  students  into  th^ir  freshnan  class  although  these 
students  do  not  me^t  the  usual  entrance  criteria* 

For  the  most  part,  these  Institutions  have  felt  that  they 
have  done  their  part  to  combat  institutional  raci^  by  confronting 
their  usual  vhite  students  with  the  presence  of  black  peers* 
Unfortunately,  these  new  students  are'  not  academic  peeM*  They 
come  from  educational  backgrounds^ in  which  proper  prepay  for 
survival  in  :^he  university  or  college  is  not  possibleC  These 
students  then  do  poorly  in  their  classes,  reinforcing  the  biases 
of  white  students  and  black  students.    "^Liberal"  professors  who 
give  underprepared  st\*dents  higher  grades  than  they  deserve  are 
"compensating*"  in  a  way 'damaging  to  those  students*  Increasingly, 
colleges  and  universities  have  come  to  realize  ^the  fsdlxire  of  their 
cumnt  programs  to  meet  the  educational  needs  of  underprepared 
stiJdents  and  axe  preparing  to  either  (l)  give  up  accepting  students 
who  do  not  meejb  the  usual  academic  requireaents  (especially,  entrance 
tests  such  as  SAT)  or  to  (2)  coamit  a  aignificantly  larger  share 
of  their  institution's  resources  to  preparing  the  new,  illequipped 
student  to  compete  favorably  in  regular  classes.    Since  this  is 
a  new  task,  no  one  is  stire  how  best  to  provide  the  required  "com- 
pensatory education"*    Certainly  one  method  worth  exploring 
extensively  is  the  use  of  highly  individualized  learning  programs  i 
with  some  way  of  pushing  each  student  to  use  all  his  ri^sources  to 
solve  problems  in  a  wide  variety  of  situations  and,  fit  'the  same 
time,  supporting  and  encouraging  the  student  with  a  high  success 
rate.    This  is  probably  best  done  with  a  combination  of  proctorlng 
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}ay  MnsitiT*,  fcnowltdgMblt,  self-oonfldent  pMrs  azyl  a  Mrles  of 
progrmnad  unite  IdMlly  AdMinisteMd  through  a  cosputar.  Tba 
coixrsa  atroctura  davalopad      Frad  Kallar  and  appUad  to  aazqr 
dlaciplinaa  (inoluding  pfayalba  at  the  Sducation  Raaaaxch  Cantar 
at  KIT  and  at  Portland  State  Univaralty)  aaaaa  appropriate. 

ADVANTAGES  OP  A  COKPOTER  IH  COMPENSATORY  EDUCATION. 

What  ia  required  ia  a  way  to  achieve  the  V]ery  difficult  taak 
of  taking  a  atulent  Imck  orer  akilla  be  haa  been  expoaed  to  and 
feela  be  knowa,  abow  bin  where  be  needa  to  rework  hia  knowledge 
and  akilla  and  let  bin  kncH  when  he  has  aaatered  than,  all  in  a 
few  Mentha  during  which  hia  auburb-txained  peers  are  progressing 
throuj^  courses  he  would  find  very  difficult  to  paaa. 

The  coaputer  has  sereral  advantages  over  progfaaaed  textual 
Materials  beret    (l)   The  student  cannot  c^t  (hiaself),  (2)  the 
^tiae  taken  for  each  frame  can  be  used  to  pufch  the  attident  to  work 
faster,  (3)  detriled  records  of  student  performance  can  be  used 
to  ikprove  the  prograa  through  continuous  modifications.    From  ay 
experience,  well^written  prograaaed  naterUla  fre  frequently  not 
used  effectively  by  students  who  are  stsapicious  of  the  nit-picking 
habits  of  acarleala  and  accept  too  easily  their  own  anawer  as 
equivalent  to  the  correct  response  given  in  the  tart«    Of  course, 
the  coaputer  is,  at  this  stage  in  the  development  of  CAI^  not 
going  to  recognise  an  original  valid  reaponae  the  firat  time  it  ia 
encountered,  any  more  than  a  programmed  text.    However,  with  a 
auff iciently  flexible  learning  ayataa  and  a  dedicated  prograaming 
ataff  (programming  in  both  computer  and  educettional  aenae),  a 
personal  reaponae  can  be  made  to  the  student  giving  original 
inaightful  reaponses  and  the  computer  programs  can  avoid  the 
rejection  of  subsequent  aiillar  responses, 

HISTORY  OF  A  COMPUTER  AUGHEltoa)  CONPENSATORY  COURSE. 

In  the  spring  ani  summer  quarters  of  I969-I97O  at  the  Chicago  ^ 
Circle  Campus  of  the  University  of  Illinois,  I  ran,  with  Barry 
Clenson  of  the  Eiiucationsl  Assistance  Program  of  the  Circle 
Campus,  an  experimental'  chemistry  course  entitled,*  "Problem 
Solving  in  Chemistry. "  5ased  on  the  analysis  by  Robert  I.  Walter 
of  the  Chemistry  Department  of  the  University      Illinois  at 
Chicago  Circle  of  pretests  of  chemical  knowledge  and  mathematical 
skills  and  subsequent  performance  in  introductory  cheaistry  Qourses, 
we  decided  to  einphasize  math  skills  rather  than  chemical  concepts 
in  our  com|a;&atory  course «    Throughout  the  course,  the  students 
used  compi^^ programs  in  groups  and  IzKiividually  on  a  2741 
terminal  *^,360/5O»    All  of  the  programs  were  written  in  CPS,  a 
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conversational  'subeet  of  "PL/if  thm  only  oonversatlonal  terminal 
langiase  available  in  the  Chicago  Circle  terainal  networic  (other 
timn  a  fora  of  BASIC).    Many  of  the  protpcam  t»ed  were  developed  . 
with  iaiividnal  attxiente  in  tutorial  aituationa   in  the  fall  and 
winder  of  1969-70.         interpretive  nature  of  GPS  waa  taken 
advantage  of  in  the  operation  of  several  programB  as  well  as  in 
the  construction  of  all.  of  the  progxaas.    The  parograaa  are 
described  briefly  in  the  appendix. 

In  the  fall  of  19?0,  I  ran  a  q^jailar  course  at  Kalaaazoo 
College,  using  a  teletype  terminal  tOva  PDP-10  at  Western 
Michigan  Itolversity,  using  prinpipally  BASIC,    I  was  not  able  to 
develop  as  wide  a  range  of  programs  at  Kalamasoo  College  as  I 
had  used  at  the  Cirole  Campus  but  I  introduced  a  learning 
sequence  in  BASIC  itself  which  seems  prtudsin^  in  terms  of 
student  attitudes  in  this  course, ^ 

PROCTORING  IN  COMPENSATORY  EDUCATION,  ' 

Since,  in  rsy  experience,  students  in  compensatory  education 
programs  need  an  extraordiniuay  amount  of  support  from  a  sympathetic 
proctor,  no  computer  learning  system  should  be  expected  to  work 
by  itself  with  a  single  student.    The  Keller  system  provides  one 
proctor  for  10  students.    No  more  than  10  students  per  proctor 
during  each  session  seems  to  be  important, 

Sttxients  work  well  Xn  groups  of  roughly  equivaleni^  ability 
and  provide  much  of  the  necessary  support  for  each  other*  This 
mode  of  operation  is  most  effective  when  a  proctor  can  fc*<i".ently 
act  as  a  group  observer  and  make  occasional  comments  on  the  dynamics 
of  the  group,    Etch  member  of  the  group  is  charged  with  the 
responsibility  of  being  sure  that  the  other  members  of  the  group 
are  fully  >participating.    This  behavior  can  be  reinforced  by 
requiring  all  the  individual  members  of  the  group  to  pass  an 
individual  mastery  test  before  the  group  can  go  on  to  the  next 
program  or  tbpic, 

LANGUAGE  COHSIDBRA^^IOlfS, 

Whether  CAI  or  other  learning  materials  and  media  are  used 
in  such  a  course,  they  will  2;equire  significant  modification  to 
serve  underprepared  students  Veil,    Since  new  materials  must  be 
developed  from  day  to  day,  a  flexible  preprocessor  of  a  higher 
level  language  is  very  desirable  as  is  an  interpretive  language, 
A  user's  group  of  educators  using  computers  in  compensatory 
education  ahou^  be  formed  to  share  the  experience  and  programs 
of  such  developmental  programs.    Eventually  we  should  have  a  shared 
set  of  programs  from  which  a  program  can  be  selected  to  fit  the  peril 
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standazdJjl 


auUr  UMda  of  a  partlottlar  stndant.    Of  coutm 
languac*        m  FUNH  wovOjd  b«  Boat  oaaful  alaoa  it  would  allow 
a  ptoezsM  daralopad  at  oat  insUtutl<in  to  te  usad  by  anethar.^ 
^  ^PLAMTT*  COUBSEVRITBR  anl  otfau:  CAX  iWaea  jttT*  yogra— lig  tiM 
bxt  Halt  tha  fora  of  tha  atttdant-cop^ot^  InWractloa,    I  hara 
f otmd  BASIC  and  AID  on  tha  FDP-10  to  ba  ai&a«t4ta  for  alapla 
laaznlng  prograaa  at  Kalaaaaoo  OollMa.   Tba  adfaataga  of  uains 
bonratttioaal  l«n«w«i>a  ia  thair  ba^l^availabla  on  aiall  coaptttara 
and  raquiring  onlgr  alniaal  coxa. on  a  laxga  eoaputar.  Tfaazafoza» 
thty  oaa  ba  aaad  by  amll  InaUtutiona  who  bava  ttelr  own  aaall 
tiaa-ataaxiss  ayataa  or  axa  porebaaing  tiaa  on  a  tlaa/ataazlBs 
natwazi(  whloh  will  not  aaintaln  a  larga  GAI  langots*  moh  aa 
FLAIfXT  or  C0UR3BIRIXIR  on  tha  pablio  libcary*  \ 

In  th»  wintar  quartar  of  1971  I  woadsad  with  a  Yaatly 
laprorod  raraion  of  PIL  (Pittabargh  Intarpratira  I«nguaga)  at 

.  tha  UniToraity  of  Hiohlgan.    Jaaaa  Gonkllh^at  tht  Oaater  for 
Raaaaxoh  on  Taamlng  &  Taaohing  in  Ann  Arbor*  Miohigan,  haa 
axpandad  PIL  to  includ*  ttaaful  ooaaands  for  atring-aanipulation 
and  tbB  capability  of  oalliJig  FQBTRAN  or  aaseably  ocmpilad  8ub> 
■routines.    I  find  that  this  co«biaation  of  a  powerful  intarprater 
dealing  with  atudanta  dirictly  with  auch  6f  tha  work  being  done 
in  ooapiled  prograaa  to  be  flexible  and  efficient  and  to  allow 

.  rapid  prograa  aodifications, 

GCWCLUSION.  *  , 

If  universities  and  colleges  are.  serious  about  coapenaatory 
education,  they  aust  aalce  *  considerable  investaent  in  spedlal 
prograaa  deaigned  to  equip  undarprep(a?ed  students  to  coape;te. favor- 
ably in  regular  couraes,   Thia  investaent  oan  be  in.peraonnel 
and/or  hardware.    Hardware  alone  ia' not  .enough,  but  coaputara  can 
be  an  effective  part  of  auoh  an  effort^ 
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AFFENDIX)   BrUf  dMoription  of  computer  pirogxmas  written  for 
«M  in  oa«pwu4tor7  coutms  at  Chicago  Circle  Caapus  of  tha 
IMvaraity  of  TlUnoia  and  at  Kalaaasoo  College. 


PROGBAir  LAN3UAGB 
GAl/aPS        GPS  ' 


Line 
Len^h 


GPS 


Slide  Rule  CPS 


BRIEF  DESCRIPTION 

A  flMlblt  CAI  prootssor  with  Bost  of  tha 
fMturts  of  COURSEVRITER  and  qu^to  a  ftv 
Boro  foatuTMe    Thara  axo  two  laalc  modts 
of  aporatlon  •  author'  and  atudant*  Tho 
ixAhar  mode  is  daalgntd  for  toaohers 
with  no  podor  coaputtr  ozparlonco«  The 
aaln  adymntago  of  thla  yartioular 
ajTBtam  is  tha  flaxibUity  in  tbt 
of  pattams  it  can  raoo^dsa  in  tha 
otuiant  raaponaaa.    Tha  struotura  is 
capitbla  of  sona  ayntactidal  analorsia^ 
The  student  intaraction  is  unfortunataly 
Buch  too  slov  to  ba  an  affaotivA  fora 
of  C3AI.  ' 

A  Tary  siapla  porogzaa  that  porints  out  a 
pair  of  **s  spaead  at  random  intarvals 
acToas  tha  paga«    Tha  atu&ant  is  askad  to 
aatiaata  diatahca  batvaan  tha  in 
savaral  diffarant  langth  units «  Thm 
.atudant  is  giran  tha  ratio  of  his  ^ 
answar  to  tha  corract  length  in  percentage, 
ilhan  the  sttadlant  uses  this  infoisation 
ha  is  of  coqcsa  doing  drill  and  practice 
in  percent  as  well  as  line  length  estima* 
tion»  1 

A  generative  drill  and  practice  routine 
designed  to  improve  the^speed  Mxd  acoitracy 
of  the  student's  use  of\  slide  zuLe^ 
Several  parameters  nay  be  vas^ed  by  the 
stulent  or  proctor  to  increase  the  diff i-> 
culty  of  tha  task*    The  computer  counter- 
responds  vith  "decimal  point  error", 
"decimal  point  OK",  "close",  "very  close", 
"GREAT!",  depending  pn  tlje  student  re- 
sponse«    The  time  taken  from  the  presenta- 
tion of  the  problem  to  the  correct  stu- 
dent response  is  printed  out  in  an 
attempt  to  eitoourage  faster  slide  rule 
calcuIatioxui« 
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PROGRAM  LANGIUSE 
T«ft  a  Drill  CPS 


Sxabol 

Muilpula- 

tion 


CPS 
AID 


Vodc 
poroblmui 


CPS 


Dr»gst«r  GPS 
SUulAtloa 


Drags tor  -  CPS 
Simple  Kodel  BASIC 


BRIEP  DESCRIPTION 

The  studmt  s^cts  tho  kind  of  prola^aa  . 
i%.S.f  first  dagTM  •quatlons,  yn^gatlTt 
ainibcrs)^    A  parttMt  of  gennatod  pco* 
blMS^  tKo  of  eftcb  of  3  to  5  typeci  in  each 
pmbln  Mt«  l8  givon  and  soozad,  Pxobloa 
tjpes  aitaod  art  than  prtaaniad  until  5  art 
correotly  answarod  in  a  ron,   Tbtn  a  poat« 
taot  ia  uatd  to  aaaaura  any  iapxrvaatnt 
in  studant  ptrformnoa,   A  pirintoiit  of 
th»  intazaetion  ia  proridod  for  l)oth 
•tttdont  and  instructor. 

A2ga1>zaic  aquations  are  prasentad  with 
tbo  rtquast  to  raarzango  the  aquation  to 
solvo  for  one  particular  Tariable;  A 
variety  of  symbols  say  be  used^   If  thi 
atudent.  fails  to  pcrorida  a  valid  rauy 
rugenent^  a  cozxact  rearzangeiient  is 
presented. 

This  is  an  attaapt  to  produce  an  econco^* 
icai*^  GAI  eystea  within  a  very  liaited 
context  of  word  probleas  requiring 
nuaerical  answers  with  wits.    The  pro- 
bleas theaselves  toe  in  a  sepezate 
booklet.    The  coaputer  only  accepts  and 
evaluataa  student  answers.    The  format 
for  the\  units  is  quite  ^  limited. 

A  digital  implementation  of  a  hybrid  com* 
puter  program  written  by  SAI,  Thxee 
coupled  seoond^order  differential  equations 
are  involved.    Program  is  both  ele^f  ahd 
inefficient     the  hybzrid  obviously  does 
this  job  better, 

A  very  simple  mathematical  model  of  the 
finish  time  of  quarter  mile  drag  race  as 
a  function  of  7.  variable  parameters  (suph 
as  wheel  base).    Possible  outcomes  are 

SLIP,  FLIP,  PmSHED  IN  13,7  SECONDS  J)X 
146  MILBS  PSR  HOUR,    This  program  was 
used  to  colore  functions  of  one  and  two 
variable*. 
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ARMCHAIR  UNKNOWNS 


Clifford       Venler  and  Manfred  G.  Relnecke 
Department  of'  Chemistry 
Texas  Christian  University 
Port  Worth,  Texas  76129 


We  would  like  to  report  our  experience  with  a 
computer,  simulated  organic  qualitative  analysis 
course  as  a  teacKing  aid  in  undergraduate  organic 
chemistry  over  the  past  two  years.     Previously,'  ^mith 
(1)  has  outlined'  a  dialogue  for^mat  for  this  type  of 
course  between  student  and  the  University  of  Illinois 
PLATO  system,  but  not  \ll  of  us  have  PLATO  as  a  helper 
Gasser  and  Emmons(2)  described  a  system  for  using  the 
computer  to  test  results  from  the  laboratory  which  is* 
similar  to  computer  simulated  unknowns,  but  emphasizes 
a  different  aspect  of  computer  us^,  namely  data  sor- 
ting.   Our  own  system  derives  from  that  describe^  b^ 
Swets  and  Peurzeig(3)  many  years  ago,  which  uses  a 
computer  to  teach  , medical  diagnosis,  a  subject  very 
similar  in  nature  to  organic  qualitative  analysis. 
The  underlying  idea  of  any  such  computer  exercises  is' 
that  a  real  patient,  need  noi  die  in  order  to  allow  a 
novice  to  learn  through,  correction  of  his  mistaken 
inferences.     In  an  organic  Qualitative  analysis 
simulation, . the  student  is  hot  burdened  with  short- 
comings in  his  conception  or  the  logical  framework 
of  analysis  and,  at  the  same  time,  misinformation 
gleaned  fr9m  his  incompletely  developed  techniques. 
Hopefully,  in  his  first  semester  laboratory,  the 
student  has  gained  a  modicum,  of  confidence  in  his 
basic  experimental  skill.    Our'  goal  is  to  simulate 
the  lab  experiemie  which  the  student  will  encounter  , 
as  closely  as  one  can  on  CAI  CRT's  with  a  Course- 
writer  program  on  an  IBM  1500  computer. 

Being  inexperienced  in  the  use  and  programming, 
of  computer-assisted  instrui^tion,  we  have  also  attemp- 
ted to' keep  things  as  simple  as  possible,  consistent 
with  our  goals.    For  this  reason,  we  decided  to  tie 
our  course  to  a  single  text,  R.  L.  Shriner,  R.  C. 
Puson,  and  D.  Y.  Curtin,  "The  Systematic  Identifica- 
tion of  Organic  Compounds,"  Fifth  Edition,  Wiley,  New 
York,  196^^(SPC).    We  chose  this  text  as  the  most 
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popular  and  widely  used  of  its  geijre. 

Coded  responses  are  far  more  easily  programmed 
than  ti}e  keyword  recognition  dialogue  described  by 
Smith (1),  yet  allow  immediately  a  far  more  extensive 
set  of  experiments  and  unknowns  than  the  program 
described  by  Gasser  and  Emmons(2),    Shriner,  Puson, 
and  Curtin,  as  a  text,  is  easily  coded.    We  have  ' 
done  so  as  follows: 

.1.    Compounds (unknowns)  are  given  tags,  mm-nnn-x 
where  mm  is  the  table  number  of  the  table  in 
which  the  compound  appears  in  the^  Tables  of 
Derivatives  on  pages  307-382  of  SPt;,  mm  is 
the  boiling  point,  melting  point,  or  decom- 
position point  of  the  compound,  whichever  is 
listed  first  in  Table  mm,  and  x  is  an  index 
in  case  of  multiple  listings  under  mm-nnn,^ 
simply  denoting  that  the  compound  is  the  xth 
one  with  mm-nnn.    For  example,  2"|i|-dimethyl- 
benzenesulfonyl  chloride  is  the  second  com*- 
•  pound  with  melting  point  3^*^C.  listed  in 
Table  l\5.    Its  code  is  ll5-3^"2.  Methyl 
benzyl  ketone*  is  the  first,  and  only,  com- 
pound in  Table  38  with  a  melting  point  of 
270c.    Its  code  is  38-27-1.    Its  boiling 
point  of  216OC.  is  listed  after  the  melting 
point  and  may  not  be  use*d  in  the  code. 

2.    Chemical  tests  are  divided  into  three  cat- 
egories. 

a.  Preliminary  tests  are  individually  coded, 
ig  -  ignition  test 
mp  -  melting  point 
bp  -  boiling  point 
sg  -  specific  gravity 
nd  -  refractive^ index 

ad  -  optical  roMtlon      ^  ' 


si  -  solubility 
b.  All  .classification  tests  are  coded  c-yy 
where  yy  is  the  number  of  the  experiment  * 
outlined  on  pages  112  to  17^  .of  SPC.  For 
y  example,  the  Tollen^s  test  i3  c-29,  the 
chromic'  anhydride  test  for  primary  and 
secondary  alcohols  is  c-8v 


If 
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c*    All  derivative  tests  are  d-iz,  where  zz 
is  the  procedure  number  from  pages  229 
to  304  of  SFC.    For  exampl^),  a  2,4-DNP 
is  d-l8,  and  an  amide  of  a  carboxylic 
acid  is  d-2. 

The  program  itself  is  presented  to  the  students 
entirely  in  the  first  two  weeks  of' the  second  semes- 
ter.   This  is  imperative  since  they  must  have  confi- 
dence in  their  logical  approach  to  their  l^iboratory 
unknowns  as  soon  as  possible.    This  procedure  has  the 
added  advantage  that  other  CRT  users  usually  don*t 
need  computer  time  during  the  first*  two.  weeks  of  their 
courses,  allowing  one  to  have  twenty  to  thirty  hours 
of  time  each  of  the  first  two/ weeks.     Each  student  is 
first  instruct*ed  and  drilled  ^in  the  use  of  the  key- 
board and  light  pen,  then  taught  the  codes  and  how  to 
us*e  tl^em.    As  a  preface  to^  his  computer  unknowns,  he 
is  given  a  trial  problem  and  assured  that  no  score  is 
kept  on  his  progress  through  it.     In  fact,  he  is  en- 
couraged to  play  with  the  system  a  little  to  see  what 
it  will  and  will  not  do.    At  this  point,  the  student 
will  (either  .flounder  or  start  to  use  his  book.  He 
must  read  and  understand  the  introductory  'material  in 
the  book  before  he  can  make  very  much  progress  on 
this  trial  problem.    After  completion  ot  the  trial 
problem^  he^ is  assigned  three  unknowns  to  identify. 
So  far,  we  haye  given  each  student  a  carboryl  compound 
another  neutral  compound,  and  an  icid  *or  a  base. 

The  CRT's  are  made  available,  usually  in  the 
evening  or  on  weekends,  on^a  fir^-come,  first-served 
basis..   The  student  may  leave  the  computer  at  any  time 
and  return  later  without  penalty.    We  ^^ncourage  the 
student  to  sign  off  -4^  he  takes  more  than  a  few  min- 
utes between  tests.    Typically,  a  student  will  get 
five  minutes'  worth  of  data  from  the  computer,  then 
»  leave,  returning  later  after  interpreting  the  facts 
with  the  aid  of  the  lab  text,  do  one  or  two  additional 
tests  and  repeat  the  process  until  he  has  identified 
the  unknown.    A  typical  student-computer  dialogue  is 
as  follows. 

.CRT:  Do  you  have  the  handout  sheet  on  which  the 

codes  for  the  experiments  are  listed?  If 
i  not,  get  one  from  the  assistant  before  you^ 
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C 


CRT: 


CRT:* 

Student 
CRT: 
CRT: 
student 
CRT: 
CRT: 
Student 
CRT: 
CRT: 


continue.  You  will  also  need  the  Shriner, 
Fuson^  and  Curtln  textbook. 

When  you  have  a  copy  of  the  mimeographed 
sheet  with  the' codes  of  the  experiments  on 
it,  and  the  , textbook,  then  press  the  space- 
bar to  continue. 

The  unknown  Is  a  solid.     Which  te^t  or 
experiment  do  you  wish  to  make  now?  Type 
In  one  of  the  codes  below. 

Ig  (Igpltlon  test)** 

burns  with  a  smoky  flame. 

The  unknown,  etc. 

mp  ^(meltlng  point) 

31-3^^C.  ^ 


The  unknown,  etc . 
sf  (sodium  fusion) 
All  tests  negative^ 
The  unknown,  etc. 
Student^  ;sl  (sGluLlllty) 

CRT:  ^  ^K^O,  no;.HCl,  no;  NaOH,  no;  HjSO^ ,  yes 
CRO;:  ^The  unknown,  etc. 

Student:     off(slgned  of  f -the  CR^:^  temporarily) 


This  statement  Is  repeated  after  each  test  result 
is  displayed.     It  will  be  abbreviated,  "The  un- 
known t^tc  . " 

For  th,6  reader's  convenience,  the  intei^pi^etations 
of  the  codes  are  included  in  parentheses. 
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The  CRT  displays'" an  INDEX  when  the  student  returns 
and  the  student  uses  the  light  pen  to  return  to  his 


unknown t 

• 

CRT: 

The  iinknovm,  etc  . 

Student : 

0-9  (2,^-dinitrophenylhydraz)lne) 

CRT: 

deep  ired-purple  ppt 

CRT: 

The  unknown,  etc. 

Student : 

0-29  .(Tollen's  test) 

CRT: 

silver  mirror  deposited 

CRT: 

The  unknown/  etc . 

Student : 

otr 

The  CRT  displays  an  INDBX  when  the  student  returns 
and. the  student  uses  the  light  pen  to  return  to  his 
unknown . 


CRT:  The  unknown,  etc. 

Student:  d-l6  (semicarbazone) 

CRT:  white  solid,  mp  225-2270C. 

-CRT:  The  unknown,  etc. 

Student:  d-l8  (2,4-DNP) 

CRT:  reddish-violet  solid,  mp  greater  phan  250°C 

QRT:^  The  unknown,  etc. 

Student:  "bp  (boiling  point) 

CRT:  ■  approximately  250Oc. 

CRT:  The  unknown,  etc. 

Student:  d-il9  (oxime) 

CRT:  white  solid,  mp  108-109°C. 


c  9-90 

r 


CRT:  The  unknown,  et^  . 

Student:     22-37-3  (piperonal) 

CRT:  Excellent,    You  hav.  arrived  at  the  correct 

answer.  Next  you  Will  be-  shown  the  INDEX  ■ 
of 'unknowns  available.  Use  your  light 'pen 
■to  choose  one. 

Although  no  formal  evaluationJof  this  program 
has  been  attempted,  it  appears  that  the  students 
work  much  more  efficiently  on  'their  laboratory  un- 
knowns after  exposure  to  our. CAT  organic  qualitative 
analysis  scheme.    Pdr  ex^ple^ while  many  students 
were  not  able  .to  finish  five  unknowns  per  semester 
prior  to  CAT,  they  are  now  able  to  splve,  six  unknowns 
with  no  loss  in  accuracy  of  identification.    We  are 
pursuing  modifications  designed  to  make  the  Simula-', 
tion  even  more  realistic. 

Acknowledgements :We  wish  to  thank  the  staffs  of  the 
Instructional  Systems-  Institute  and  the  Computer 
Center  of  Texas  Christian  University,  especially  Mrs. 
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tance. 


REFERENCES, 

(1)  S.G.  Smith,  J.  Chem.  Ed.,  kl ^  608(1970). 

(2)  W.^Gasser  and  J.L.  Emmons,  J.  Chem.  Ed.,  ^^7. 

137(1970),         '  -  —  r-  ; 

(3)  *J.A.  Swets  and  W.  Reurze.ig,  Science,  I50,  572' 

(1965).  •  •   


9-91 


Chemistry  and  CAI  at  the  University  of  Texas 
at  Austin:  A  Coordinated  Approach 


G.  H*  Culp 
The  University  of  Texas  at  Austin 

Introduction 


Utilization  of  computer  techniques  in  undergraduate 
chemical  education  has  been  an  on-going  process  at  the  Univer- 
sity of  Texas  at  Austin  since  1965.    Thelfirst  carefully  controlled 
educational  experiments  involving  the  use  of  (3AI  as  a  supplement 
to  genera*!  and  undergraduate  organic  chemistry  were  conducted 
there  during  1968-70.        The  results  of  these  studies  indicated 
a  consistent,  highly  positive  effect  on  student  performance  and 
attitude  when  computer-based  techniques  were  applied,  'these 
studies  have  also  provided  an  Excellent  foundation  upon  which  to 
base  current  efforts  toward  the  actual  implementation  of  supple- 
mental  CAI  programs  in  undergraduate  chemical  education. 

The  test>rfg~and  evaluation  of  the  original  programs  was 
conducted  through  the  coope.ration  of  the  University  of  Texas 
Computer-Assiflted  Instr,uction  Laboratory  and  utilized  the  IBM 
1500/180(f  computer  system.    As  these  studies  progressed, 


9-92 

y 

several  limitations  with  the  available  facilities  became  apparent. 
First,  by  definition,  the  CAI  laboratory  is  cieciicated  to  re  ^eajrcK^ 
Often  several  concurrent  projects  in  various  educational  disci- 
plines are  in  development  within  the  laboratory,  thus  severe 
limitations  are 'placed  upon  the  number  of  stud^rxt  participants  in 
a  given  project.    Likewise,  the  ease  of  access  to  the  progaratrxfl 
is  confined  hy  the  large  number  of  projects  axid  the  Limiteci  faoiU 
ities,--'  Secondly,  the  Coursewriter  II  language  v^as  found  t:o  be 
inappropriate  for  several  types  of  potential  C^I  applicatioxis  tbat 
requi.re  "direct  computational  capability,    And^  in  addition^  the 
programs  could  not  be  teleprocessed  to  remote  terminals  due 
to  hardware  video  data  transmission  limitations^ 

It  was  evident  that  both  a  more  versatile  language  a 
more  flexible  system  would  be  necessary  to  sufficiently  r  ealisse 
the  CAI  potential  applied  to  undergraduate  ch^niistry  cour  ^es^ 

The  Development  of  CLIC 

As  an  outgrowth  of  the  documented  eff ectivenftss  of  Ca  I 
and  the  need  of  a.  language  that  would  provide  the  capabilit-y  fox: 
CAI  utilization  on  a  broad  basis  within  the  University  of  T^xa^ 
system,  computation  Center  personnel  developed  the  langvx-age^ 
CLIC   (Conversational  Language  for  Instruction  and  Conmp vating)* 
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The  CLIC  language  bears  a  number  of  similarities  to  the 
well-known  Cour8ewi:iter-type  languages  in  its  use  of  ^  very  sim- 
ple statement  format  and  question-answer  logic.    The  language  is 
actually  an  extension  of  the  PICLS  language  developed  at  Purdue^ 
University.    The  basic  CLIC  language  is  very  easy  to  learn  and 
can  be  readily  mastered  with  only  a  few  hours  instruction*  Thus 
the  beginning  user  quickly  reaches  the  point  where  he  is  writing 
simple  instructional  programs* 

For  the  experienced  user?  however,  CLIC  differs  substan- 
tially from  both  PICLS  and  COURSEWRITER*   CLiC  is  designed 
as  an  extension  of  the  gen4ral  purpose  programming  language, 
FORTRAN*    Thus  the  CLIC  user  h^s  access  to. all  of  FORTRAN, 
including  the  fuUxomputational  capabilities,  data  structures^ 

library  subprograms^  and  input-output  system;    FORTRAN  state- 

» 

ments  may  be  intersper&ed  with  CLIC  statemeAts  in  CLIC  pi'o- 
grams,  CLIC  programs  may  call  FORTRAN  subprograms,  and 
FORTRAN  programs  may  call  CLIC  programs.    For  the  experi- 
enced user  this  access  to  FORTRAN  allows  him  to  make  optimal 

*- 

J 

use  of  the  special  features  of  CLIC,  such  as  the  question-answer 

♦ 

logic,  and  hp  may  still  revert  to  FORTRAN  when  the  CLIC  langua 
proves  too. limited. 

The  CLIC  processing  system  also  differ^  substantially 
from  most  orther  CAI  systems.   CLIC  programs  are  translated^  ^ 
into  Fortran  by  a  special  preprocessor,  after  which  the  standard 
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Fortran  compiler  prpcesses  them  to  produce  highly  optimized 
machine  code*   A  CLIC  program  may  be  saved  in  this  compiled 
form  and  then  executed  whenever  a  student  wishes  to  interact 
with  the  program.   Execution  of  a  CLIC  program  is  extremely 
efficient  both  because  of  the  compiled  form  of  tlie  program  and 
b^cAuse  all  data,  including  text,  is  stored  iti  core  8torage*a\ong 
'  with  the  program  so  that  the  only  input-output  involved  in  exe- 
cution is  the  input -output  to  the  student's  ter»minaU 

Another  advantage  is  clear:  a  partial  solution  to  the 

wlclely  recognized  problefn  of  inter-institutional^yBtem  copipati- 

j 

bility  is  at  hand*   If  the  institution  has  a  FORTRAN  compiler j 
transference  and  exchange  of  CLIC  programs  may  be  accomplighed* 

/  V 

Thus  CAI  capability  may  readily  become  available  to  a  majority 

of  the  various  computer  systems. 

** 

The  University  of  Texas  Chemistry  Project 

V 

/  ,  •    ■  ~  . 

In  the  past  year,  translation,  revision,  and  expansion  of 

the  original  CAI  programs  into  CLIC  has  been  accomplished. 

Utilization  of  the  programs  will  commence  in  the  fall,  1971. 

Currently/^^the  chemistry  CAI  facilities  include  access 

to  a  proposed  16 -terminal  "laboratory"  to  be  used  jointly  with 

the  physics,  zoology,  and  computer  science  departments..  In 
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addition,  4  terminals  are  available  solely  for  the  chem/stry 
project^participants.    The  terminals  are  Model  ASR-33  tele- 
types; access  to  the  computer  is  over  standard  telephone 
lines.    The  terminals  will  possess  rahdom  access  slide  capa- 
bility for  the  display  of  information,  spectra,  lenghthy  textual 
material,,  etc#  / 

■  / 

Following  the  original  theme  and  strategies  employed 
in  the  initial  studies,  the  programs  are  supplen^ental  to  and 
coordinated  with  the  conventional  course  mat^iaU  Described 
broadly,  the  programs  may  be  defined  as  either  experiment 
simulation,  remedial  drills  tutorial  dialogue,  or  evaluative 
interactions.    Individualization  is  stressed /through  random 
function  generation  of  numerical  vcilues,,  question  se^uemce  (in 

the  drill  programs),-  and  extensive  branching  capability.  In 

'      ^  ...  * 

addition,  most  programs  allow  the  student  to  request  ai-d  or 

additional  inforrpation,  skip,  commertt,  dr  stop  during  a  given 

interaction/  *  ^  . 

More  specifically,  the  General  Chemistry  project  will 

involve  approximately  200  students  in  a  total  multi-media 

approach  to  teaching  that  will  dUow  student  access  to  film  loops, 

audio -tutorial  tapes,  and  the  CAI  programs  of: 
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A,  Tutorial/Drill  • 

1.  Scientific\Notation 

2.  The  Metric  System 

3.  Temperature  Conversions 

4.  Formula  Writing'  ^ 

\j  5.  Inorganic  Nomenclature. 

*  •  /  / 

6,  Percent  Composition 

7.  Atomic  Structure 

8*  Balancing  Equations 
9.  The  Mole  Coricept, 
10*  yiie  Gas  Laws 
11,  Meet,  Work, and  Energy 


IZ^'^Energy  Changes  in  Chemical  Reactions 
"13 J  Stoichi^ometry 

14,  Concentration  Units  -  ^  ^ 

15*  Ionic  Equilibria 


B,  Exjjeriment  Simulation 

1.  Millikan  Oil  Drpp  Experiment 

2.  J.J.  Thompson  Discharge  Tube  Experiment 

3.  Geiger-Marsden  Experiment  ' 

4.  Mosely  X-Ray  Experiment  ' 

5.  Emission  Spectroscopy 

6.  CoUigative  Properties 


ERJC. 
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1.  Acid-Base  TitMtion 

8»  Calorimetry 

9»  Iron  Ore  Analysis 
10  •  Kinetics 

lU  General  Quantitative  Unknown 
\Z.  Molar  Volume  "of 
13*  Qualitative  Analysis,  and  ^ 
14*  Synthesis  of  cU-tran»_  Isomers 


AppTokimately  150  students  will  participate  in  the  Organic 
Chemistry  project.   Although  the  project  does  not  currently  make 
use  of  film  loop  or  audio -tutorial  teaching  aids,  fairly- extensive 
supplementation  of  the  course  is  provided  via  the  CAI  programs  of: 
A:  Tutorial/Drill 

U  Nomenclature  of  Alkanes 
Z.  Nomenclature  6f  Alkenes 
*  ^  3;  Preparati9i\s  and  Reactions  ^of  Alkanes 

4i  Nomenclature  of-fhe  Derivatives  of  Benzene 

5.  Preparations  and  Reactions  of  the  Derivatives  of  Benzene 

6.  Organic  Syntheses  I  ' 

7.  Organic  Syntheses  U  ^  . 
8    Nomenclature  of  Alcohols,  Aldehydes  and  Ketones 

.9,  Preparations  and  Reactions  of  Alcohols 


9-98 


10.  Mechanisms: 

a.  Halogenation  of  Alkanes 

b.  Addition  of  HBr  to  Alkenes 

c.  Dehydration  of  Alcohols 

d.  Bromination  of  Toluene 

e.  Bromination  of  Benzaldehyde 
^     U,  Elementary  NMR  Interpretations  , 

Experiment  Simulation 

1.  The  Synthesis  of  Iv-Methylcyclohexene  and 
3-Methylcyclohexene  Irom  2-Methylcyclohexanol 

2.  Kinetic  Studies  I 

3.  Kinetic  Studies  II 

4.  Organic  Qualitative  Analysis 

C.  Evaluative  / 

1.  Quiz:  General  Organic  Nomenclature 

2.  Quiz:  General  Organic  Reactions 

3.  Quiz:  Basic  Reaction  Mechanism 

It  is  hoped  that  the  expanded  version  of  the  chemistry 
project  described  ab.ove  ^ill  experience  the  success  exhibited 
by  the  ea.rlier  pilot  studies. 


<2l  ^rW 
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MODERN  METHODS  OF  HANDLING  CHEMICAL  INFORMATION 
by  Martha  Williams 


Chemical 'information  and  data  are  the  products  of  testing^,  re-^ 
^    3eai:cti,  'and  development^  and  they  become  a  resource  for  ntfw,  o'^it  * 
further  research  and  development.    Most  chemists  have  ah  VppWcia- 
t-idn  for,  or  ajsociate  a  value  with,  information  and  ^data.  '  th'^-areJ^J 
,    '     also:becoming  increasingly  aware  of  the  fact  that  infor?n*ationJ.feoiia/ * 
can  be  generated,  processed,  stored,  managed,  retrieved;(.^.;a^d:Used 
'  '    i^n  new  ways.    This  is' a  result  of  the  increase  in  use;  and'methods;,"v 
of  economically  usin^,  computers,  in  chemistry.    Computers  are.msed*. 
as  tools^in  research.  ,  They  are  used  for  control  and  mqijiitDriiig  d'f 
^    ^  experiments,  and  for  Simulating  experiments^-whether.  ^i^i  be  for.''./  ' 
analysis,  kinetics,  or  synthesis  of  compounds."   Oacfe/the  research*  / 
or  testing  is  completed,  the  products  of  the  work  i^e^/the  datVy  ; 
.    ,  can  be  selected,  reduced  from  analog  to  digital  form,'  tabplated,  r 
graphed,  charted,  printed  or  otherwise  put  iifi.use^^le  fopm., '''fh^se 
computer,- generated  data--in  either  human-readable  or  maphiWr:   '  * 
readable  form,  on  paper,  magnetic  tape,  disks,  .drums,  chips  pi^..  '> 
microform- -now  become  a  part  of  a  vast,  va'^ed,  and  disp^yVe  data 
reservior.  ^  •  ^ 

'  Infdrmation  and  data'can'be.  considered  to  be  Of '  sevei*i9.1  ^types  ^ 

(1).  digitized  numeric,  'as  in  the  case'  of  chemical  h.an'^bdok^S*;  (2).     ^  *■ 

..analog,  as  in  the  case  of  IR  or  other  spectral  data,  (3.3 /symbolic,'  \^ 
as  in  the  case  of  structures  of  chemical  dpmpounds,'  and"^4)  ^ipha- 

"betic  as  in  the  cas'e  of  information  expressed       naturgfX  language/,,  * 
These  d^ta,  if  they  are  fo  be  used  again  as  'an.  input  to*'furthet,.     •  ^ 
research),  must  be  stored  on  an  appropriate  medium,  arranged  :iti^a 
file  stru(iture  suitable  for  searching,  and  maT)agedN  intelligently .  .  ' 
The  existence  of  sucH  a  .data  base  enables  Us  ta  conduct  se'arehes  ^ 
and  retrieve  infonjiation  that  Enables  the  researcher  .to  ol?taij\/ '  . 
backgroui>d' information,  Keep  up-to-date  in  his  fieW,  supplement     '  ^  [ 
thft  inform^ion  he  ha?^  identify  persons,  orga^izations, -documents, 
or  other  sources  of  information,  or  to  make  j\idgments  a:^;  to  whether  • 
^the  reseaj^.ch  he  propbses -has' already  been  .done.  ^  ^r/  ' 

^  Machine  readable  data  b&ses  ji'ave  been  develop*ed  as  direct.  Z^'* 

'  products  .of  ;research  and  as  secondai^y  or  tertiary  products.  .  that  ^. 
is,  research  may  be  directed  toward  generation  of  data-rlR-speet^ral 
for  example--or^  the  resyits^  of  research  may  be  written  tip  in  ^  ' 
.    journal  articles,  that  are' then  abstracted  and  indexed  in  secqrifdary 
sources.    Th^  journal  information  or  thfe  abstract/index  inlormation 
may  be  put  in  machine-readable;  form  either  for  purposes,  6f  sear-Ching 
or  for  purposes  of  automating  production  and  printing  functioijs. 

^jii^Prior  to'-tfte  adv^ht  of  economic  computer  composition  the  cost  of 
keypunching  data  to  create  large  files  f^r  cqmj^uter  searching- was 
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very  high  and  could  seldom  \)e  justified  on  the  hasis  of  information 
^retrieval  alone. 

,  .Novf,  that  large  data  bases  are  often  created  as  a  byproduct 
of  publication,  the  cost  factor  for  input  is  no  longer  so  signifi- 
cant.   In  addition  to  private-files,  many  nev  data  bases  have 
become  publicly  available  from  commercia:i  organizations,  government 
agencies  and  scientific  and. technical  societies.    It'behoaves  the 
chemist  to  become  aware  of  these  ne^w  .sources  of  chemical  informa-  ' 
tion  and  to  learn  how  to  use  them.    He  may  use  them  within  his  own  ' 
organization  or  he  may  purchase  th^  service  from  inform^ition  brokers 
in  information  tenters^       .      •  . 

During  today's  session  we  wili  hear  reports  of  current  activities 
in  searching  and  retrieving^  chemical  information  and  data.  The 
papers  and  demonstrations  will  run  the  gamut  from  handling  of" 
analytical;  data,  data  for  organic  synthesis,  syinboUc^data  of  organic 
structures,  through  natural  language  .information  in- primary  and 
secondary  sources.     '  ,  *  .  . 
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Conference  on  Computers  in  Chemical  Education  and>Research 
■    V  ~     July  23,  1971 

^  Northern  Illinois  University,  DeKalb,  Illinois 

.  .  •  '^OERN  METHODS  FOR  SEARCHING 

THE  CHEMICAL  LITERATURE 

Bart  E.  Holm 
Secretary's  Department 
E .  I .  du  Pont  de  |^Jemours  &  Co . ,  Inc  . 

INTRODUCTJON 

Beilstein  once  said,  "I  regd  everything.  I  place  if  where  i.t  belongs." 
You  can'f  do  this  any  longer  except  in  narr6w  fields.  You  are  blessed  or 
plagiJed  witbvOn  abundance  of  published  qnd  unpublished  information  (Fig.  1) . 
This  is  an  historical  continuation  of  a  growing  Wealth  of  knowledge  waiting  to  be 
put  ^o  use  (Ffg.  2).  Chemists  ore  fortunate,  for  of  all  the  disciplines  chemistry 
is  unique  with  a  tradition  of -information  excellence  and  a  tradition  of  information 
organization.  Although  you  have  growing  information  resources,  you  do  have 
new  .and  improving  mechanisms  for  managing  these  resources.   I  will  state  the 
types  of  services  a  jchemisj"  requires  and  introduce  the  modern  methods  and  formats 
used  in  these  services.  You  will  see  the  critical  role  played  by  computers.  The 
papers  and  the  demonstrations  to  follow  will  show,  in  detail,  how  these  methods 
are  being  widely  used  to  help  the  chemist*  Many  of  these  you  can  apply  on  your 
campus,  mony  are  inexpensive,  all  are  useful  .  We  hope  you  will  see  ways  to 
advance  the  educational  process  through  th^se  methods  and  services. 

CATEGORIES  OF  NEEDS  AND  INFORMATION 

The  informatfon  needs  of  chemists  can  be  stated  as  the  need  for  current 
awareness,  selective  dissemination,  and  retrospective  search  services,  of  research, 
development,  engineering,  production,  and  marketing  information  located 
Internally  or  externally,  and  contained  in  journals,  patents,  theses,  reports,  data 
files,  information  services,  and  from  people.  Current  dworeness  is  continuous 
professional  education  through  reading  journols,  attending  meetings,  and  informal 
communications  with  colleagues.   Selective  dissemination  is  receiving  information 
related  to  your  interests  from  the  remaining  published  literature  beyond  that 
scanned.   Retrospective  search  Is  looking  fOr  information  to  answer  a  spe^:ific  need. 
Chemists  have  traditional  and  new  servicesjn  each  of  these  cqtegories. 

Chemical  information  can  be  categorized  as  dealing  with  stj^iictures,  data, 
and  concepts .  ,  -  ^ 
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Structures 

Most  of  the  effort  m  the  information  processing  field  has  been  in 
chemistry  with  much  of  that  ieffort  on  improved  wa^s  of  manipulating  chemical 
structures.  Chemists  use  chemical  structures  to  communicate  information  about 
a  chemical  compound.  Chemists  need  access  to  structural  information,  and  the 

structure  is  a  discreet  entity  amenable  to  organization. 

'  ---^ 

Chemists  commonly  represent  chemical  compounds  with  pictures,  an 
excellent  method  for  one  chemist  to  communicate  with  another  using  a  blackboard. 
Each  understands  the  meaning  of  the  message.   Unfortunately,  the  diagrams  are  ^ 
not  pronounceable,  the  diagrams  cannot  be  ordered,  and  the  multidimensional 
representations  are  difficult  for  printers  (Fig.  3)4 

A  number  of  nomenclature  systems  are  in  existence,  each  assigns  one 
correct  name  for  each  compound.   Nomenclature  serves  weW  for  oral  communication 
and  for  printing  but  nomenclature  is  not  very  effective  for  claivsearching  because 
the  main  structural  part  carries  many  different  names,  depending  bpon'^iie  structure 
of  the  rest  of  the  compound .  ^ 

Beilstein  Cfassification  is  a  common  way  of  identifying  chemicals  but  it  is 
difficult     search  for  subcode  information  unless  you  know  the  major  classification. 

Pictures,  nomenclature,  and  classification  have  a  place,  but  each  has 
drawbacks  for  organizing  large  collections  for  indexing  and  searching.  The  major 
emphasis  in  recent  years  has  been  on  fragmentation,  notation,  and  topological 
coding  (Fig  .  4) .  ^ 

In  fragmentdtion  a  compound  is  represented  as  a  composite  of  its  major 
structural  features.  Usually  the  fragments  or  functional  groups  are  assigned 
numerical  codes.  Many  schemes  were  developed  in  the  late  1940 *s  because  the 
material  could  be  coded  on  punched  cards  for  later  search.  Most  files  have  been 
converted  to  computer  systems.   Fragmentation  is  a  useful  technique  but  it  does 
not  define  the  structure  completely  enough  for  some  searches.  With  notation  the 
structure  is  represented  by  a  single  line  of  symbols.  The  next  paper  describes 
chemical  notation  systems.  In  topological  coding  the  structure  is  represented  by 
a  unique  array  of  symbols  in  a  compact  format  for  storage  and  search .  The  paper  - 
On  the  Chemical  Abstracts  Servire  includes  mention  of  their  registry  file,  chemical 
structures  represented  by  topological  -coding .  Some  of  the  scheduled  demonstrations 
show  how  these  methods  have  been  applied. 
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•These  three  approaches  are  major  advances  enabling  chemists  to 
manipulate  large  collections  of  chemical  structures  to  locate  specffic  structures 
and  to  bring  together  groups  of  structures.  The  Committee  on  Chemical  Information 
in  1969  published  "Chemical  Structure  Information  Handling  -  A  Review  of  the 
Literature  1962-1968/'  publication  1733,  National  Academy  of  Sciences, 
Washington,  1969.  '  . 

Data  ,  .  '  ,1 

Data  are  numeric  or  quantitative  notations  helping  to  describe  a  subject  ^ 
more  precisely  •  Chemists  are  interested  io^melting  points,  boiling  points,  molecular 
weights,  and  specific  gravity.  These  identifiers  are  easy  to  list  and  manipulate 
manually  or  by  machine.  One  area,  analytical  data  often  irucludes  instrumented 
techniques  yielding  indefinite  curves  or  figures  for  compounds  and  mixtures.  For 
identification  these  must  be  compared  with  the  curves  or  figures  of  pure  standards. 
Searching  data  by  ccfmputer  is  included  in  another  paper  and  in  the  demonstrations. 

Concepts  ^ 

Concepts  cover  qualitative  information  in  chemistry  as  contrasted  with  data 
and  structures.  Concepts  interact  and  overlap  and  are  not  mutually  exclusive. 
Ablatiorj,  polymerization,  extrusion,  and  spinning  are  all  concepts  and  part  of  the  . 
chemical  literature.  Concepts  have  their  own  forms  of  presentatior)  and  manipulation. 
Here  are  some  of  the  new  approaches . 

ACCESS  TO  QUALITATIVE  INFORMATION 
Current  Awareness 

Chemists  read  journals  to  learn  the  details  of  new  developments  and  to  keep 
current  in  their  field.  Most  professionals  do  not  try  to. scan  all  the  journals  that 
might  have  articles  of  interest.  Those  that  try  to  scan, many  jpurnals  usually  end  up 
with  tables  In  their  office,  den,  or  bedroom  stacked  with  unread  magazines. 
Subscribe  to  and  regularly  read  a  few  core  journ,als.  Use  a  current  awareness  service 
to  alert  you  to  other  material .  Of  the  new  ce.-vices  available  twb  are  representative. 
One  called  CURRENT  CONTENTS^'Physical  i&^hemical  Sciences,  gives  title  information  by 
reproducing  the  tables  of  contents  of  hundreds  of  foreign  and  domestic  research 
journals.   The  weekly  publication  is  extremely  fast.   The  usefulness  of  the  idea  has 
caused  many  organizations  to  create  their  own  table  of  contents  publication  to 
cover  their  journals  In  their  area  of  interest. 
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The  second  service,  CFigmical  Titles^  listing  tiMes  of  articles,  is  issued  ' 
biweekly  and  covers.700  iourrials*  .Titf.es  are  rotated  or  permuted  by  computer' to 
lijt  the  significant  words  'in  alphdbetjcal  "order  (Fig .  5) .  Since  the  work  can  be 
done  by  computer  it  offers  efficient,  inexpeosfve  presentation*  The  title  oppeors 
qs  many  times  as  there  are  significant  words  in  the  titl6»  A  list  of  common  ^ords 
called  a  stop  list  is  used  to  prevent  alphabetizing  on  articles  and  other  common 
words-  These  permuted  listings  are  called  KWIG  (Key.word -in -Context)-  Some 
KWIC  indexes  drop  the  title  information  that  canhot  be  incliided  within  a  set 
number  of  characters  for  the  line-  An  advance  was  made  by  taking  the  keyword 
out  of  the  title  and  listing  it  on  the  left'side  of  the  page  followed  by  the  full  title-- 
These  systems  are  called  KWOC  (Keyword -out -of-Context) . 

The  next  level  of  jcompl^eness  for  current  awareness  beyond  scanning 
titles  woulcf  be  to  receive  abstracts-  The  total  abstracts  publications,  Ghemjpal 
Abstracts,  Biological  Abstracts,  and  Physics  Abstracts  are  seldom  used  for  current 
^awareness  toda'yj  because  of  size-  The  approach  by  these  organizations,  is  to  produce 
subsets  of  the  tc(ta|  in  narrow  fields-  Chemical  Abstracts  is  published  in^  sections - 
CASifclso  offers  Bb^ic  Journd  Abstracts,  Chemical -Biofogical  Attivities,  and  Polymer 
Science  and  TechnoTbgy-  Index  Chemicus,  published  weekly  by  the  Institute  for 
Scientific  Information  emphasizes  synthesis,  isolation,  identification  and/or  biological 
activity  of  new  chemical  compounds-  Other  abstract  services  cover  potents-  These 
are  all  examples  of  published  current  awareness  services-  ^ 

Selective  Dissemination  of  Informotion 

Keeping  vp  with  the  literature  by  current  awareness  techniques  is  often 
too  big  a  task  for  an  individual  -  Selective  dissemination  of  information,  usually 
called  SDI,  is  the  process  of  sending  specific  information  too  person-  Good  librarians 
have  done  this  for  years-  As  they  review  the  material  coming  into  their  collection, 
they  send  items  to  those  known  to  be  interested-   People  fortunate  enough  to  have 
exj^utive  staff  assistants  have  the  same  job  performed  for  them  -   Now  some  services 
are  available  to  help  you  and  me. 

^Each  person  participating  in  an  SDI  service  prepares  a  list  of  terms ,^  cal led 
a  profile,  describing  his  area.  As  material  comes  In,  the  profiles  are  matched  against 
the  indexing  terms,  the  titles,  the  cruthors,  the  references,  or  the  abstracts  to  look 
for  matching  items-  The  person  receives  notification  of  newly  received  literature 
that  matches  his  requirements. 
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In  the  most  commoh  form  the  notification  consists  of  two  punched  cards. 
The  first  card  contains  a  title  and  sometimes  an  abstract.  The  secondxard  is  a 
form  to  request  the  full  article,  and  also  gives  a  way  to  comment  on  hoyv  close 
the  item  matches  the  request.   Systems  adjust  the  profiles  based  on'this  feedback. 
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 Spme  publishers  offer  SDI  services  to  Individuals  or  will  provide  magnetic 

apes  for  processing  by  your  organization  for  local  SDI  services.  CAS  offers^ 
magnetic;  tgpes  for  Chemical  Titles,  Chemical -Biological  Activities,  CA -Condensates, 
and  Pofymer  Science  and  Technology.   Engineering  Index  offers  a  similar  data  base 
prepared  from  3500  engineering  publications.   Excerpta  Medico  Foundqtlon  provides 
magnetic  tope  files  covering  3000  biomedical  journals.  ASCA,  Automatic  Subject 
Citation  Aiei  \ ,  from  the  Institute  for  Scientific  Information  processes  profiles  o^ainst 
the  input  to  the  Science  Citation  Index.    The  user  supplies  a  list  of  auth|Ors  or  cited 
references  and  receives  as  a  product  all  newly  published  work  citing  his  stated 
references.   In  essence  ASCA,  or  the  Science  Citation  Inde^  brings  the  researcher 
forward  in  time  by  listing  those  recently  published  documents  which  cite  one  of 
interest  to  him.   In  addition  to  authors  and  reference,  profiles  can  consist  of 
combinations  of  words,  word  stems,  word  phrases,  organizations,  and  journals.  One 
of  the  papers  that  follow^  describes  how  centers  process  data  bases  on  magnetic*  tape 
to  provide  tl3.ese  services  to  you. 

Current  awareness  and  SDI  services  offer  practical,  economicol,  methods 
^io  assure  that  you  are  alerted  to  newly  published  work  from  all  over  the  worl^d.  Read 
just  a  few  core  journals.  Use  current  awareness  and  SDl  for  the  rest. 

Retrospective  Search 

You  have  head  how  to  keep  current  with  th^  published- literature .  Some 
items  you  will  read  and  discard,  some  you  will  keep.  As  soon  as  anyone  accumulates 
a  few  thou'aand  of  anything,  except  money,  he  may  have  a  problem  finding  what  he 
needs  when  it  Is  required .   Here  again,  there  are  simple  ways  to  organize  your 
personal  collections  to  find  any  item  eas+ly  (Fig.  6). 

First  consider  how  you  will  search  your  files.   Deci(Je  whether  you  will  use 
single  entry  or  multiple  entry  files.   The  admissions  office  keeps  Its  records  by  the 
names  of  students .  A  similar  system  can  be  used  In  chemistry  for  a  file  on  physical 
properties  of  chemical  compounds  If  the  only  questions  asked  are  to  supply  property 
values  of  specific  compounds.   In  these  cases  pick  your  file  subjects  and  set  up 
the  famillpr  classification  system. 
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In  science,  frec^uenfly  this  does  not  occur.  The  need  might  be  fo  locate 
all  chemical  comf^und^  with. a  porficular  property  value.  A  multiple  entry  system 
can  handle  multiple  suipject  searches. 

Traditional  systems  describe  an  item  by  a  term  or  two.  Index  more  deeply  . 
Describe  the  articles  in  your  collection  by  five  to  ten  subject  terms.  Fufly  describe- 
each  item  in  order  to  pibvide  flexibility  in  finding  what  you  want  quickly  and 
economically. 

The  terms  you  se^ect  must  be  organized  for  consistent  input  and  search. 
A  modest  amount  of  vocabulary  control  can  assure  that»information  is  not  lost 
because  you  indexed  it  one  way  and  searched  for  it  in  another.  Be  consistent. 
Use  CA  nomenclature.  The  Condensed  Chemical  Dictionary  is  another  good 
reference.  If  your  subject  area  is  extensive  and  complex  conjsider  formalizing 
your  vocabulary  by  recording  synonyms,  generic  levels,  and  related  terms  (Fig.  7). 

A  variety  of  storage  forms  are  available  for  the  index.  First,  give  each 


article  or  item  in  your  collection  a  sequential  file\number  and  store  the  items  in 
order.  Then  set  up  a  record  for  your  index.  Here  is^ne  form  (Fig.  8) .  A  card 
is  used  for  each  term.  Record  the  item  number  on  thejerm  card  that  applies.  In 
searching  select  the  terms  of  interest  and  look  for  matching  item  numbers.  In 
the  example  report  100  discusses  "ablation  of  plastics  in  heat  shields  of  space 
vehicles". 

The  Committee  on  Chemical  Information  has  a  file  of  over  300  articles 
on  recent  significant  developments  in  chemical  information  p^-ocessing.  Abstracts  ' 
for  each  are  kept  in  serial  order.  The  storage  form  for  our  index  is  a  computer 
produced  optical  coincidence  deck.  Each  dard  is  a  term.  Item  numbers  are 
recorded  as  holes  in  the  card  and  answers  to  questions  pre  found  by  overlaying 
term  cards . 

Computers  are  used  to  produce  indexes  for  manual  searching  as  well  as 
files  for  machine  searching/  Any  of  these  methods  can  assure  quick,  complete 
retrievaJ  from  your  files.' 

Microforms  .  • 

Files  can  become  voluminous.  Collections  of  journaU,  abstrdcts 
publications,  and  patents  can  grow  larger  thdn  the  space  available.  Microforms 
have  come  into  active  use  to  reduce  storage  requirements,  provide  rapid  access, 
^  and  are  an  Inexpensive  way  to  provide  multiple  copies  of  whole  documents.  The 
common  microforms  are  microfiche,  aperture  cards,  apd  roll  film  in  cartridges  (Fig  .9). 

Er|c  •    ■  b..   GiJ.  . 
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A  microfiche  is  sheet  mJcrofilm  about  4x6  Inches  commonly  containing 
5  rows  of'12  images  and  a  header  strip  for  dbcumenf  identification.  Government 
reports  are  available  as  microfiche  for  less  cost  than  full  size  hardcopy. 

Aperture  cards  are  tobuloting  cards  with  a  hole  in  which  the  frim  is  placed. 
Normally  onfe  frame  is  included  in  the  card  although  some  systems  hojd  up  to  eight 
frames.  The  Department  of  Defense  stimulated  the  use  with  the  ruling  that  all 
engineering  drawings  submitted  for-  their  contract  work  must  be  on  aperture  cards 
meeting  their  standards.   U.Si  patents  are  to  be  available  on  aperture  cards. 

The  most  familiar  and  economical  microform  is  16mm  roll  film.   For  liigh 
usage  the  film  is  stored  in  cartridges  which  simpli  fits  threading  in  reading  equipment. 
A  100  foot  reel  of  16mm  film  usually  contains  2500-3000  8-1/2  x  11  Inch  poges  of 
material .   The  American  Chemical  Society  offers  many  of  their  publications  in  this 
forry  including  ACS  journals  and  Chemical  Abstracts. 

The  thick  listings  you  formerly  received  from  the  computer  cs 'printout  can 
now  be  returned  to  you  on  16mm  film  or  microfiche  by  using  COM  (Computer  Output 
Microfilm)  equipment. 

Chemical  Information  Resources  ^  ' 

Chemical  information  is  widely  available  in  the  United  States.  The 
American  Chemical  Society  is  activ6  in  providing  primary  and  secondary  services 
both  as  a  wholesaler  and  a  retaiJer.  Of  the  commercial  services  in  chemistry,  the 
Institute  for  Scientific  Information  is  the  most  extensive.  A  number  of  their  products 
have  been  described  and  you  will  see  Qthers  during  the  demonstration. 

The  Biosciences  Information  Service  publishes  a  variety  of  products  in 
their  field  including  Biological  Abstracts,  and  a  KWIC  index  called  Bi  ological 
*  Abstracts  Subjects  in  Context. 

The  National  Library  of  Medicine  in  Washington  is  a  reservoir  of  published 
literature  in  the -medical  sciences.   Each  month  an  abstract  journal,  Index  Medjcus,  is 
published.  All  incoming  journals  are  indexed  for  storage  and  retrieval  for  their 
computer  system  called  MEDLARS . 

The  United  States  Government  has  many  other  information  services  of  value 
to  chemists.  A  notable  resource  is  U.S.  patents.  The  U.S.  patent  office  has  issued 
nearly^. 6  million  potents,  a  rich  source  for  chemists.  The  patent  office  has 
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concentrated  in  recent  yeas  in  improving  their  method  for  supplying  copies  . 
of  patents.both  as  hardcopy  ond  in  microforms  They  have  done  little  to  improve 
the  intellectual  access  to  the  content  of  patents.  Commercial  services  fill 
this  gap  by  offering  abstracts  and  indexes  to  U.S.  and  Foreign  patents.  In  the 
ijnited  States,  Information  for  Industry  publishes  the  Uniterm  Index  to  Chemical 
Parents  and  has  magnetic  tape  services.  In^London,  Derwent  Publications 
inaugurated  a  Chemical  Patents  Index  in  January,  1970  providing  abstracts, 
microfilm,  and  evenhi;ally  indexes  to  worldwide  chemical  patents. 

Summary  \ 

This  has  been  an  overview  of  new  approaches  to  the  processing  of 
chemical  information.  Remember  that  a  chemist  needs  a  balance  of  current 
awareness,  selective  dissemination,  and  retrospective  services .  New  techniques 
for  handling  structures,  concepts,  and  data  are  available  to  help  him  in  his  work. 
New  publications,  formats,  and  services  permit  him  to  choose  the  inputs  to  his 
problem  solving  procedures.   The  computer  is  playing  a  key  role  in  many  of 
these  services.  These  are  all  tools  to  help  the  chemist  do  his  work  more  effectivel 
(Fig.  10) .  The  methods  are  available,  many  are  inexpensive.  They  are  widely 
,used  in  industry  and  in  the  government.  Can  these  methods  help  your  faculty  and 
your  students,  now  and  in  the  future?  We  dre  convinced  that  they  can. 


RtAP  AU  VOU  COT  NOW  ^  ** 


FIGURE  1 


1  ^- 1 


ABSTKACT  mUlB 


FIGURE  2 


CHEMICAL  STRUCTURES 


CHEMICAL  STRUCTURES 


fictuft 


CM/ 


[  F  F 

y  -  0   jC  C-H 
F  F 


Pt>«««to*«,  Q,o.,h^,  T«tr«fluofO  •  m-M«thyl 
B«i(tttin  Clmi;»cttK>n 


FIGURE  3' 


Eth*r 

fhtortm 

••AMnt  Rtn«  2t2 
1-1  C«Aflfurtti«n 

N*Ut>pn 
FVFXFFOR  C 


-O^C  CM 
F  F 


C 
C 
O 
F 
F 
F 
F 
C 
C 

c 
c 
c 
c 
c 


FjGURE  4 


L 
1 

1213L 


4 


>iMi««o;cA^«qic  tcio  a^o 

M|NO  *01*iC   KIO  ANO  A. 
tlACTIOll  Of   OfMA-  ANO 
,  WlTH^f  sriMl^lCillON  Of 
Ci^iOIC  ACIO  •fTilNf  I 
MdVlilOjS  MiC.N{rt*NfA« 

OK  Of  M»^l«NfTTIO  CUIN 
Of  TNf  MKIMOCM.  «ATl 
f«SON'.»  ^(«)«UIAriO«i 
t^OftS  SUillMATI* 
Mf  NiNtU-SAl^f  Tf  t«»f  THf 
1  TAV-O^fMroA  IMfOJtAl 
H(  NfUf*ON  SiOKlNG  OOMN 
H>NO«*(HOlOOICAL  HAIritt^L 
^A(T  ^A««i«t^T(«  tNO  WAVE 
*  lMft«0Cr»N4HlC  OlffoSlOH 
*  *         tiNET  fC 

Mt«T«M  'rOC«-*0COi  vulOV 
SOLUTION  Of  ThC  H4&*0Kt 
l,C4l   SoiO^IONS   TO  ILOC»t 

TICN  Of  Tm{  <«AMOrtTIVT 
f  0«*«f  VUUS  tV 


f^SdON-AMlxO  C*^«OiC  ACIO  AnO  E^Sll  CIlMAl-COJt-t)^! 

(^>I10I*-CA^«0  LACTAM  IN  LACTAM  OILS.  C  I  NMA|-00»  I- 1  S^l 

f^SlLON-OI   AMINO  ^IMIilC  ACIO  IN  ^H^L A 1 -0022- 1 >1* 

f^SlLON-NALO  KITONCS  hlTN  (WT*!**  J*CSAT>00«2'0>«» 

f^SILON-f»<-»INl  OXr  CAKlONVL-L  LVSINl  C JChAG-OO^I-O)!^ 
f^SILQiHtf  T«l  MtTNVL  L  rSIN(  (.••MMONI '^O^T  A(-00l)'-02*t 

fOUHITr  Of    TnE   tL(CT«ON  ANQ  NUCLION  NCI  A  AT -00*»->0«2  I 

(OUATlOH  AtOuT  A  CVllNORICAL  rAHTKi  J(»SA*-«0»2«»t») 

fOUATlON  ANO  MONTI  CA«LO  KISUiTS  JC  ^S  A*-«0^^- 101) 

(OUATION  fO«   THE    SEOUCNTlAL  KANOOM  ACSAA«.002>')7«) 

COUATION  Of    STATC  Of   lAKKCK  ANO  H(NO  JC^SA*-«0»l-»2«^ 

(4UATI0N  Of    THE    STA7(  Cf^ONATOMIC   V  T f N| A9-002«- 1  7 ) t 

COUATION.-                     •SCATTERING         T  ^T^i(AV-O0^3^O093 

EQUATION.*           *fQ*  THE   AfAMANGEMfN  JC  ^  A*-«0»'l-»20« 

COUATIOM.*  TRANSIENT  SOLUTIONS  Of  T  NS(NAO>00>«-0273 
tOJAT10N.*«   ANO  llfETlMES  FROM  THt   ^  JC^SA*-00»2'0«2* 

EQUATIONS  fO«  Oi«(CT  (XCITa7I(M  ANO  J^t\tl-O003-O02 t 

EQUATIONS  fOR  Mol  7 1  COMrftNe  NT  AllOvS.  «T&f  tf-000l-0M» 

EQUATIONS  fOR  SURfACt  AOSOR^TION.-  litMAA^OOOVOlO) 

EQUATIONS   TO  A  MICROSCOPIC   TH(ORY  I E  PY  AA,>02  >  1*0010 

(OUATIONS.'           •CONOUCTOKS.     (XACT  l( fV A A'02 32-001 1 

EQUATIONS.*     •S^MtCONOUCTORS.     NUMfR  ISTKAI-OOlO-IOlO 

C\}V4T|0NS.  *V>  fKA£TIONALtY  tHARCEO  CJPHAD^0«I--027« 

EdWLiiRiuN  ANO  Quantitative  analysI  chooat-027o-oo25 

EQUlLllRIUM  CENTftIf UCtTION  IN  CISIUM  ^HYTAJ^^OIt^ 


FIGURE  5 


NOW  LtTs  see- — . 
FIGURE  6 


Coricept  Coordinatian 


(tffCt 

A  mutn 


uin 

FIGURE  7 


PEPORT  luO 

ABLATfON 
PLASTICS 
HEAT  SHIELDS 
SPACE  VEHlCifS 


ABLAT|C^* 


PLASTICS 
M£AT  SMlUw^ 
SPACt  VtHlCUS 


FIGURE  8 


ERIC 


1 


CHEMICAL  NOTATION 
A  LINEAR  DESCRIPTION  OF  CHEMICAL  STRUCTURE 


by 


Carlos  M.  Bowman 
The  Dow  GJiemical  Company 
Midland,  Michigan 


10-15 


ABSTRACT' 

Linear -notation  systems  provide  a  machine-compatible 
description  of  chemical  structures.     The  most  widely  used 
is  the  Viswesser  Line  Notation  (wLN) •     This  report  presents 
a  brief  summary  of  ±he  WLN  system,   its  background-,  appli- 
cations,  and  significance  in"^  registry,   searching,   and  cor- 
relative research.  •         '  '  i  ' 
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For  centuries  chemists  have  found  it  necessary.' to 
Hlevis^  systems  to  represent,   in  both  written  and  oral 
form,    the  nature  of  chemical   substances.     First  attempts 
•were  rather  unsystematic  and  led   to  the  development  of 
trivial  names.     It  is  interesting  to  no te , '^however ,  that 
as   the  deficiencies  of  trivial  names  become  apparent , 
chemists   started   to  use  drawings   to  convey  to  other 
chemists  the  exact  nature  of  particular  chemical  com- 
pounds.    Thus,   chemical  structures  or  structural  diagrams 
came  into  use.     Over  the  years  attempts  have  been  made  to 
systematize  names  and  a  large  body  of  rulfes  which  govern 
the  nomenclature  of  chemical  compounds  has  gradually 
evolved*     Nevertheless,   chemical   structures  have  con- 
tinued to  be  a  very  important  medium  of  communication 
between  chemists.  J 

\fe   are   talking  primarily  about  methods  by  which 
chemical  characteristics  suchj  as   structure  are  repre- 
sented on  paper  ^nd  orally.     ^s  compounds  become  more 
complicated,    systematic  noraen<*latur e  becomes  very  burden- 
some and  chemists  resort  again  to  trivial  njaraes.  However, 
when  they  want   to  express  the  actual   structure,  uniquely 
and  unambiguously,    they  use  structural  diagrams. 

In  order   to  facilitate  locating  information  about 
chemical  compouliids  and*  classes  of  chemical  compounds , 
chemists  have   organized  nam^  and  structures  into  ordered 
lis t-s-^vrfTicI^  we  commonly  call  indexes .     In  doing  so,  they 
have  accentuated  a  particular  comp9nent   of  the  molecule, 
thus  giving  it  prominence  in  their  indexes.  'However, 
chemists  have  always  wan^ted  to  be.  able  to  associate  com- 
pounds with  similar  stru^'cture  characteristics  -which  are 
not  necessarily  those  noirmally  highlighted  or  accentuated 
in  nomenclature  and,  in  <)rdinar^  indexing.. 

The  advent  of  pun6h  cards  and  later  of  computer 
technology  prompted   the  development  of  coding  schemes 
which  permitted  the  ordering  of  names  or  structural 
diagrams  or  identification   tags  of  compounds  from  a 
number  of  possible  entry  points .      Perhaps   the  most  widely 
used   schemes  are   the   so-called  fragmentation  codes .  In 
using  fragmentation  codes  a  series  of  structures  i«  broken 
up  into  ^gnificant  components  and  codes'  are  assigned  .to 
each  one'^'f  these  components  qr  fragments .     These  codes 
ar*e  numbered   or  assigned  particular*  punchers  on  either 
regular  computer   type  punch  cards  dr  edge  notched  cards . 
I^any  such  systems  havo  been  developed,    each  one  usually 
designod   to  meet    the   specific  need^,  of  the  individuals 
using-  a  particulnr  collection  of  chemical  compounds. 
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To  retrieve  informktion  from  one  of  these  files,  one 
merely  has   to  select   the  fragments  of  interest,  obtain 
the  fragmentation  codes  assigned  tOv those  fragments,  and 
then,    through   the  use   of  a  needle  in  the  case  of  edge 
notched  cards  or  a  .sorting  or  collating  machine  in  the 
case  of  punched  cards,   go  through  the  entire  file  and 
select   th.ose  cards  which -have  been  coded  with  the  desired 
fragments.     This  card  retrieval  usually  completes  the 
search ,    for  each  card  c  on tains  a  reproduc  ti  on  of   the  name , 
the  structure   of  the  compound,   perhaps   some  of  its  basic 
properties,    and  usually  a  reference  number  that  leads 
the  user   to  files  where  more  detailed  information  is 
avail#bl e .  v     , , 

As   the   size  of  these  coded  files  grew,    i  t  b>ecame 
impera  tiv.e  to  devise  faster  and  more  def  ini  tive  ways  of 
retrieving  information.      With  the  advent   of  computers 
many  of  these   systems  were   expanded  aiad   translated  to 
computer  searching  so   that  sjearches  could  be  done  faster, 
and  could  accommodate  more  complex  combinations  of  frag- 
ments.    However ,    as'  we  moved   to  computer  searches  of 
files  which  had  been  indexed   through  fragmentation  code? 
we  reduced   the  ability   to  retrieve  immediately   the  names, 
the   structur^es,    and   other  information  about  compounds. 
All   that  we  normally  retrieve  fro^  a  computer  fragmenta-- 
tion   type   search' i's*..  a  listing  of  numbers  which  then 
necessitates  reference,   usually  manual,    to  a  separate 
file   to  6btadi;n  •associated  information.     Another  real 
problem  with  f ragipenta tion  codes  is  the  fact  that  for 
a  given  compound   there, is  not  a  unique   set  Of  'codes,  and 
from  a  particular^  set   Of  codes  it  is  not  usually  possible 
to  reconstruct  in  an  unambiguous  manner  the   structure  which 
g-enerated   that   set   of  codes.    'Mos  t"  fragment  a  ti  on  codes 
tell  us  what  particular  chemical   fragments  are  present 
in   the   compound,   but  do  not   tell  us  much  about  how  they*^ 
are.  associated  in  ar  molecule.     Thus,    for  large  files  the 
number  of  extraneous  material  which  is  retrieved  is  very 
sig^iificant  and  troublesome. 

I   think  it  is  apparent   then  that  chemists  felt  a  ^ 
need  for  the  development   of  "a  code   system  which  would 
permi  t   the  use  of  mephanic  al  device  s  for   s  tor age  and 
retrieval,   whj-ch  would  permit  the  representation  of  a 
chemic  al   s  true  tur  e  in  a  unique  manner ,    and  \vhich  would 
allow   the  unambiguous' reconstructio?i  of  the  chemical 
structiare  from  the  code  representation.      Jt\  addition*, 
the  code   shouTd  be  precise  and  easy  to  use.      If  I  wore 
to  list  all    the  properties  of  such  a   system,  you  would 
very  likely   say   that  chemical  nomenclature  would  mo(*t 
most   of  these  requiren\ont  s ,    and  .that  i§   true.  **Howovor, 
nomenclature  is  riot-easy   to  use;    that  it  is  difficult 
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to  arrivfe  at  the  proper  name  and  once  gne  has  ?  nkme  it 
is  rather  difficult  to  manipulate  it  using  computer's  and 
other  such  mechanical  devices.     Thus,    chemical  notations 
came  into  being.     Many  notation  systems  have  been  developed 
over  the  years^     G.   Malcolm *Dyson  developed  a  notation 
which  parallels  chemical  nomenclature v5) .   Herman  Skolnik 
developed  a  system  which  is  used  at  HerculesUO).  other 
notation   schemes  include  Hayward ' s v 7 j ^   which  hasxb 
studied  at^the  U.   S.   Patent  Office,   and  Silk'sv5)i 


een 
in  Europe. 


It  h^s  been  characteristic  of  these  systems   tjiat  they 
draw  a  fair  amount  of  attention,    and   then  fade  away  .because 
of  a  lack  of  use.     An  exception  to  this  has  been  the 
Wiswesser  notation   system,   developed  by  William  Viswesser, 
which  was  first  reported  in  1954U6j.     j  believe  that 
this   scheme  of  identifying  chemical   compounds  would  more 
than  likely  have  ^one   the  way  of  all   thQ  other  notations 
had  not  a  group  of  interested  users  decided  to  experiment 
with  Wiswesser 's  ideas  and  concepts  and  make  a  concerted 
effort   to  prove  and  develop   the   system.     Thus,    over  the 
period  of  the  next  10-12  years,   a  nvunber  of  chemists  who 
were  respo;nsible  for  the  retrieval   of  information  from 
moderately  large  and  complicated  chemical   files   tried   to  ' 
use   the  notation   scheme  developed  by  Viswesser.  This 
resulted  in  a  rather  extensive  revision  which  improved, 
clarified  and  extended   the  basic  principles  of  the  system. 

Concurrently  with  the  work  of  the  information  oriented 
people,    a  group  of  individuals  interested  in  handling 
chemical   structures  via  computers  became  interested  in 
the  notation  and   tried   to  write  computer  programs  which 
would  permit   the  detailed  manipulation  of  codes  derived 
from  notations  using  the  existing  notation  rules.  It 
became  apparent  as  the  very  systematic   and  rigid  discipline 
of  computer  programming  was  applied   to   these  rules   that  a^ 
number  of  inconsistencies   and  deficiencies  existed.  Con- 
sequently a  cooperative  effort  among  the  original  inventor 
of  the  notation,    some  of  those  people  interested  in  ^ 
informatioi?  retrieval,    and   the  computer  people,    led  to 
the  development  of  a  manual   of  rules,   published  in  1968 
as   the  Wiswesser   Line-Formula  Chemical  Notation(^2) . 
We  have   since  adopted   the  name  Wiswesser  Line  Notation 
and.  use   the  acroj^m  VLN.      I  will  use   that  for  the  remainder 
of  my :  talk  . 

^  I  would  like   to  emphasize   that   the' present   set  of 

WLN  rules   are  rules  which  have  been   tested  and  which  have 
survived   a  fairly  long  period   of  use.     They  are  not  the 
rules  conceived  by  one  individual,   but  rather  th^y  repre- 
sent  the  agreement  of  many  people  who  have  used   the  system 
in       real   environment.     Unlike  many  of  the   other  notation 
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systems  which  I  have  mentioned  very  briefly,  earlier, 
th^  WLN  has  become  widely  used  by  many  industrial  con- 
cerns.    In  addition,    one   of  the  major  information 
publishers,    the  Institute  for  Scientific  Information, 
Inc.,   has  launched  a  ser-C-ice,    the  Index  Chemicus  Registry 
Sy s tem»   based  on   the  VLN  sy s t em • 

I  would  be   the  f  arv  s'r^To'  add ,   however,    that   the  system 
is  certainly  not  complete.     It   still  has  room  for  improve- 
ment and  further  development.     Hopefully,   my  brief  descrip- 
tion today  of  the   system  and  what  can  be  done  with  it 
might  interest  some  of  our  friends  in  the  academic  wo^ld  - 
to  further  test,   develop  and  expand   the  appl icabili ty| of 
the   system.  ^ 

The  basic  philosophy  of  the  notation  system  is  based  " 
oh  the  use  of  a  limited  set  of  symbols   to  describe  rather 
elementary  components  of  a  chemical   structure.     Listed  on 
Fiffure  1   are   some  of  the  VLN  symbol s  and   the  chemical 
characteristics  which  they  represent.     Because  of  time, 
I  cannot  really  describe   to  you  in  great  depth  how  the 
notation  works,     I  can  only  give  you  some  very  simple 
examples  of  how  it  relates   to  chemical   structures.*  The 
context-rich  system  uses  only   the  26  alphabetic  characters, 
10  Arabic  digits  and   several   special,  symbol^  which  are 
commonly  used,    such  as   the  asterisk,    the  dash,    the  amperscind 
and   the   slash.     The  size  of  this  limited  vocabulary  is 
essentially  doubled  by  use  of  the  blank  space;  symbols 
acquire  a  different  meaning  if  preceded  ^by  a  space.  By 
the  judicious  assignment  of  these  symbols   to  particular 
chemical  characteristics,   and  by  application  of  WLN  ritles, 
we  are  able   to  represent  uniquely  and  unambiguously  the 
majority  of  organic  compounds  using  the  W.iswesser  Line 
Notation  system. 

On   the  next   slides   (Figures. 2  -  6)   I  would  like  to 
show  you  some  examples  of  chemical   structures  and  tl\e 
Wiswesser  notations  which  have  been  derived   for  them. 
You  will  note  that   there  is  a  one   to  one  correspondence 
between  the   structural  fragment  and   the  notation  symbol. 
In  addition,    the  notation  is  a  complete  description  of 
the  compound  inasmuch  as  connec  ti  ons  be  tween  the  various 
elements  and  groupings  are  clearly  indicated.      Since  the 
notation  is  limited   to  commonly  occurring  symbols,    it  is 
easily  adapted   to  computer  manipulation  and   thus  we  have 
a  simple  way  of  representing  in  computer  readable  form 
the   total   structural  information  about  a  given  compound 
or  compounds . 

There  are  many  uses  which  have  been  made  of  the 
notation  using  computer  systems.      Perhaps   the  first 
application  of  computers   to   the  handling  of   lists  of 
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Wiswesser  notations  was  the  simple  listing  of  notations 
in  alphabetical   order  by  Howard  Bonnettv^^fT     The  nature 
of  the  rules  and   symbol  assignment  is  such  that  an  alpha- 
betized listing  tends   to  group ' together   similar  compounds. 
Thus   some  generic   searching  i^  possible  from   such  lists. 

A  very  logical   extension  to   this  was   the  application 
of  Key-Word-In-Context   techniques  to  notations   to  produce 
a  permuted  index(l3).     it  is   Ihls   type  of  index  which  the 
Index  Chemicus  Registry  System  produces  on  a. monthly  basis 
for  new  compounds  rep or  ted  in  the  li  terature     ) ^  These 
two  applications   employ   the  computer   to  reorganize  the 
notations  and   to  produce  a   tool   for  manual  retrieval. 

Many  other  uses  have  been  made  of  notations  in  the 
last  five  years  and   some  very   sophisticated  searching 
techniques  are  rjow  avai]able-   including  some  on-line 
conversational   systems  ^    '^/^one  of  which  you  will  see 
demonstrated  later  thi,s  afternoon^ 

I  have   summarized   on   the  next   slide   (Figure  7)  some 
of  the  uses  of  WLN  computer-based  systems. 

One   of  the  more  common  uses*  of  such  a  file  is  to 
ascertain   the  prior  existence  of  a  compound  in  a  particular 
file.*    This  can  be  accomplished  with  a  relatively  high 
degree  of  accuracy  with  notations;   however,    if   one  is'^" 
primarily  interested  in  registration  and  identification 
of  imique  compounds  in  a  file,    there  are  other  methods, 
such  as   the  Chemical  Abstracts  Connection  Table  system, 
which  are   superior  for  this  particular  purpose.  Hyde 
and  his  co-*workers  at  Im^perial  Chemicals  Inc  orporaJ:ed , 
Ltd.   have   successfully  demonstrated   the  genera  ti  oii^  of 
structure  displays   from   the  notation  ;    thus,  an 

output  from  a   search  need  not  require  additional  lookup 
steps   to  find   the  name   or  require   the  scientist   to  learn 
the  notation.      Instead  a  picture  can  be  presented   to  him 
directly. 

The  greatest  use,    of  course,   with  notations  has  been 
for   searching^     Two  general   approaches  have  been  used  in 
this  area.      One  has  been  direct   searching  of  the  notation 
as  reported  by  Monsentol^S)   and   others.     Usually  in  such 
a   scheme  a  conven ti onal   text  searching  approach  is  used , 
in  which  a   certain  character  or  ga:^oup  of  characters  is 
identified  and   then  looked   for  in  the  notation.  This 
type   of  approach  has  not  been  too  widely  used  because  it 
is   oxpeAsivo  and   time  consuming  from  a  computer  standpoint. 
A  mor^  commonly  used  appron^^h  is   to  design  a  computer 
program  which  {generates   fragments   such  as   those  discussed- 
(V-^rlier  m   this  paper,    and   then  use   the  fragments  to 
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narrow  douTi   the   size   of   the   search  file^    *      .      If  neces- 
«^;ir\  ,        suits   of  this   coarse   screening  can  be  further 

(  lied  l)>    us('   of   some  kind   of  an  automated  on-lijie 
st^archin^^  or  by   visually   scanning  lists  of  notations 
proiiuccd  b>    the   fragment   search*      It   is  possible  to 
further  extend   the   searching  to  an  atom-by-atom  type 
network  ninteh  which  would   then  produce  complete  specific 
re  ti'ioN  n  1  •      Th^   economic  s   of   thi^   last'  approach  have  not 
been  nttrjjctive   tc^-dato,    and  for   that   reason  we  have  only 
soe^n   some  initial    trials   to  establish   the   feasibility  of 
such  a  concept. 

Most  notations  can  be  decoded   easily  by   a  chemist 
with  a  brii^f   training  in   the  notation  rules.      Our  exper- 
ience has  been   that   after  one  afternoon   of   training  and 
a   little   practice,   most   organic   chemists  can  read  and 
interpret  most  notations.      Coding  is  much  more  difficult 
if  one   is   interested   in   the  unique  notation  which  conforms 
to   the  IVLN  rules.      Training  and  extensive  practice  are 
necessary.      However,    the  VLN  rules  ar^  much  easier  to 
apply,  than  nomenclature  rules   and  coding  costs   are  equiva- 
lent   to  connection   table   input  costs.      Algorithmic  checking 
of  rules, of  procedure   can  be  applied   to  detect   erroi^s  in 
encoding  V /  ;   however,    the  design  of   such  programs  is 
difficult  and  costly. 

The  economics   of  notation  systems,    however,    do  make 
them  attractive   for   small    to  moderate   size   files.  The 
ability   to  produce   intelligible   ordered   lists,    such  as 
permuted  indexes,    permits  extensive   searching  of  signi- 
ficantly  sized   files.      The  generation  of  fragments  has 
been   another  common  use   of  notations.      The •c ompl exi ty 
of   the  fragmentation   schemes  has  varied   from  simple 
symbol   fragments'  '^»-^^)    to  very  extensive   codes  which 
can  only  be   searched  by   computer  sy s tems ( ^ > ^ ) . 

In   summary,    the  development  of  a  chemical  notation 
system  for   the  linear  representation  of  chemical  structures 
has   enabled  us    to   store,    in  computer  compatible   form,  com- 
plete descriptions   of  chemical    structures.      This  capa- 
bility has   led   to   the  generation  of  ordered   lists  and 
other  manual    searching  tools   for  use  in  a  conventional 
manner.      It  has   also  promoted    the  development   and  expan- 
sion of  fragmentation-based   systems  which  contain  much 
more  detailed   and  more  consistent  coding,    thus  enhancing 
a   searching   tool   which  has  been  in  existence   for  several 
decades,    and   which  has  proven  its  worth   time   and  time 
again.      Finally,    the  notation  system  allows  us   to  ^o  to 
more  detailed   type   searching.      Since   it   is   a  complete 
description  of   the   structure   of  a  molecule,    it  also 
permits   us    to  ron\(»Tt    from   on(>   s\stem    to   another.      For*  J 
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example,  with  the  proper  algorithms  we  can  go  from  a 
notation  system  to  a  connection  table,  which  in  turn 
can  lead  us  to  a  systematic  name. 

4  » 

The  availability  of  a  complete,   concisQ  and  precise 
descripti on ^of  a  chemical  molecule  in  computer  manipu- 
latable  form  opens  up  many  aVenues  of  work,   including*  the 
potential  for  obtaining  better  understanding  of  the  rela- 
tionships between  structure  and  properties  of  compounds. 
It  is  our  hope  that   this  brief  description  of  a  notation- 
system  and  its  possibilities  will   interest  you  suf f icien;tly 
to  learn  more  about  it  and  to  entice  you  into  areas  in 
which  you  can  apply  your  interest  in  computer-based  systems 
to  studies  of  the  manipulation  of  chemical  strugtjares  and 
of  structure/property  relationships.  ' 
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Figxire  1, 


WISWESSER  LINE  NOTATION 
SELECTED  SYMBOLS 


B 

Boron 

.  E  . 

Bromine 

C 

Garb on 

G 

Chlorine 

F 

Flxiorine 

M 

Imino  (-NH-) 

N 

Nitrogen 

"  Q 

-  Hydroxyl  (-0H) 
Benzene 

0 

Oxygen 

R 

P 

Phosphorous 

U 

Unsaturation  ( =) 

S 

Sulfur 

V 

Carboxyl  (^C=0) 

-NA- 

Sodium 

w 

Di oxygen  (O::  ,Jd) 

-ZN- 

Zinc 

z 

Amino  (-NHg) 

.  K 

1 

-  N  - 

» 

X 
Y 

1 

-  9  - 

-  c  -  , 

1 

-  C  = 

H 


space&-^/012  .  .  .9ABC.  .  .2^ 

( ear lies  t  latest) 

Hierarchical  Symbol  Rank 


10-2Z 


Figure  2 . 
WISWESSER  LINE  NOTATION 


Example 


CH^CH^CH^OCH^COOH 


0  1  V  Q 


QV103 


SP  Structure  Formula 

GP  Graphic  Formuaa 

Wljsr  Wlswesser  Line  Notation 


Example  2, 
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Figure  k . 
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Figure  5. 
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Figure  6. 
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I  vill  approach  my  topic  from  two  points  of  view;  first, 
the  use  at  Chemical  Abstracts  Service  of  mocfern  methods  of  handling 
chemical  ijiformation  and,  second,,  some  outputs  of  CAS  that  require 
modem  methods  for  their  exploitation*    Both,  points  will  be  illus- 
trated by  reference  to^the  CAS  ChemicaJ,  Registry  and  the  Integrated 
Subject  File*  ^ 

Modern  methods  must  be  defined  at  more  than  one  level*  Moat 
apparent  are  the  new  devices  and  'techniques  ^that  one  can  use  tb  seek 
information  from  services  that*ha>f??  been  produced  by  means  of  othet 
new  techniques  and  devices.    Of  equeil  or  greater  significance, 
although  perhaps  less  obvious,  is  the  revolutionary  change  that* 
modern  methods  can  and  should  effect  in  the  relation  between  4in 
information  user  and  the  body  of  recorded  information. 

It  is  pertinent  -for  information  users  to  have  some  compre- 
hension of  the  evolving  CAS  computer-based  proce'^ssing  system  because 
that  system  is  radiceilly  different  from  traditional  publishing 
procedures  and  technolofcy.    The  computer-based  system  is  far  more 
flexible  and,  therefore,  far  more  amenable  to  the  changing  demands 
of  the  user  community  .than  classical  'hand  operated"  publishing 
could  possibly  be.    In  addition,  modern  technology  maJces  possible 
th-e  production  of^  information  files  of  radically  new  kinds. 


*  Summary  of  a  presentation  at  The  Conference  on  Computers  in  Edu- 
cation and  Research,  iiorthern  Illinois  University,  DeKalb,  111., 
July  23,  1971- 


^0-33      .  ' 

The  CAS  Chemicqii  Registry  is  the  nojnenclature  control 

system  for  CAS  indexing  qf  chemical  substanc-es.    As  suchj  the 

Registry  is* part  of  the  day-to-day  computer-ba6ed  'proce5sing 

system  that  produces  Chemical  AbstractsV  its  indexes, 'and  various 

specialized  service^-    An  int'eprral  p^t -Of  the  Re^ci'stry  is  a 

continuously  grovin^?  file  of  machine-readai)le  representations  of    *  - 

molecular  structures  which  reflect  all  of  the  structural  detail 

(e.*^.,  stereochemistry.,  labelling:,  etc. )' published  by  tJae^author 

>     '  of 
ip^referring  to  a  substanc'e.    A^socia^^ed  with  each  sub^&ijce. 

recprd  is  a  molecular  formula  and  a  unique,  computer- assigned 
R|gistry  Number  which  is  a  link-  to  the  CA  Subject  Indexes. 

Information  centers  are  learning  tp  manipulate  these  files 
as  search  data  bases  to  seek  whole  structures  or  any  desired  sub- 
set of  atoms  and  bonds  that  may  be  less  than  a  complete  structure. 
< 

The  information-usinfc  community  will  soon  *be  able  to  obtain  search 

service  from  Registry  files  through  information  centers. 

newest  CAS  service  is  the  Integrated  Subject  File,  machine 

readable  countearpart  of  the  volume  and  collective  Gubject,  f'olecular 

Formula 5  and  Ri^gistry  I^umber  Indexes  to  Chemiced  Abstracts,    First  . 

made  available  to  subscribers  in  ISF.  Volume  71  corresponding?  to 

the  Volume  Indexes  to  CA  Volume  71,  the'  700,000  entries  in  ISF 

Vpliine  71  will  occupy  nine  reels  of  tape  in  CAS's  Standard  Drstri- 

« 

but  ion  Format.     ISF  Volumes  72  and  73  are  expected  before  the -end 
of  1971:    all  other  CA  volumes  in  the  eifthth  collective  Period 
(Volumes  6f>-15)  will  have  IGF  counterparts.    The  ISF  is  intended 
to  be  a  continuin/7  service. 


Optimun  utility  of  the  ISF,  probably  the  largest  machiihe- 
readable  infonoation  file  to  be  made  routinely  available,  will 
be  determined  by  the  user  community  itself  which  mu^t  be  depended 
upon  for  development  of  search  stratef^,  search  software,  and  an 
overall  philosophy  of  exploitation  of  this  very  large  and  very 
new  kind  of  data  base.  ^  ^         '  ^ 
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ABSTRACT 


The  Information  Sciences  section  ,of  IIT  Research  Institute 
hps  designed  and  developed  a  Computer  Search  Center  for  processing 
and  searching  varied  machine-readable  data  bases  to  supply  informa- 
tion to  a"^  variety  of  users  in  industiry,  government,  .academic  and 
other  research  institutions.     The  system  was  designed  in  a  user  / 
oriented  fashion  amd  provides  a  full  complem.ent  of  options  to  the 
subscriber.     These  include:    various  data  bases  of  portion  of  drrta 
bases;  profile  features  suCj^as  inclusion  or  exclusion  of  any  kind 


of  data  element  that  appear^  on  a  tape;  full  truhcSation  cauv^bilitics 
on  ^ny  kind  of  term;  full  free-form  Boolean  logic  for.  resulting 
terms  to  one  anqther;  grouping  of '  related  terms;  multiple  sort  options 
and -multiple  print  media  options.     Usqr  aids  Including  Search  Manuals, 
Truncation  Guides,  Frequency  Lists,  KLIC  Indexes  arid  Bicfram  Pre- 
quency  Lists  have  been  prepared  for  each  data  base.     Twelve  r-^asoris 
why  scientists^  should  use  the  GDI  services  of  information  centers 
nre.  discussed  includin'j  cost  benefits  and' comparative  posts  of  d>ata 
base -prep^^^tion,  in-house  3DI,  and  subscrlpt^n 'SDI. 
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DESIGN  FOR  UNPREDICTABLE  FUT'URE 

Information  centers  exist  to  provide  information  from 
machine-readable  cjat^a  bases  to  users  in  industry,  universities 
and  other  organizations.     The <Computer  Search  Center  of  .IIT 
Research  Instrtpt^  was  designed  with  q  number  Qf  variables 
and  uncertainties  lying -before'  us.     We  were  not  able  to  pre-    •  ' 
diet  the.  data 'bases  to  be.  used,,  the  character  sets,  the 
character  codes,  data  base  tape  formatsv  data  Base  record 
formats,  data  element?, . arid -^afa  base  content.     Nor  could  we 
predict  the  hardware  we  might  use,  user  groups,  user  require- 
ments./ vocabulary,  Search  requirements,  output  sorts,  output 
formats,  and  output  media.     These  are^  a  few  of-^the  variables 
we  saw.     Our  design  had  to  accommodate  all  of  them.     We  did  not 
pre-determine  which  data  .bases  and  what  us^r  grx)ups  we  would  . 
provide  services  to.     We  wanted  to  be  in  a  petition  to  haindle 
any^jpffata^lDase  for  which  there  is  a  significant' market .  .  i 

As  we  all  know,  search^  programs  for  handling  machiiie^  ■ 
readable  data  base^s  are  expensive  to  develop  and  expensive 
to  maintain.     We  had  no.  desire  to  incur  the  expense  of  majln-  ^ 
taining  multiple  search  mrograhis.     For  this  reason  we  developed 
a  general  purpose  search^program  that  would  handle  virtually 
any  of  the  machine  readable  data  bases  containing^  natural 
language  information. 

.  When  handling  multiple  data  bases  one  is  very  likely  to 
encounter  multiple  character  sets  and  multiple  character  codes. 
The  tape  formats* and  record  formats  differ  from  data" base  to  . 
data  base.     In  fact,  they  differ  Within  data  bases  that  , are 
produced  by  the  same  organization.       The  data  elements  contained* 
on  the  tapes  vary  considerably  from  tape  to  tape. 

*      "  *  \ 

File  Structure-  ^ 

 7  ^  \   . 

Jn  order'to  handle  multiple  data  bases  we  had  to  determine 
a  file  structujre  that  would  accor^modate  all  of  the  variables'. 
ThetiiTRI  standard  format  does  precisely  that.     WitH  tVii^ 
format  each  record  is  composed  of  a  key,  directory,  and  char-  " 
acter  string.     The  key  contains  the  volume,   issue,  and.  cita-  ' 
tior)  number  as  given  by  the  data  base  supplier?  artd  the  direc- 
tory identifies  each  type  of  element,  contained  in  the  record, 
according  to  IITRI  data  type  codes.'    Some  of  the. data  type 
elements  are  listed." 
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DATA  TYPE 

CODES 

DATA  ELEMENT 

01 

SOURCE  INFORMATION 
--CODEN 

--JOURNAL  REFERENCE 

--PAGINATION 

--DATES 

02 

TITLE  OF  ARTICLE 

03 

AUTHOR (S) 

04 

SHORT'  JOURNAL  TITLE  ' 

05 

KEYWORD (S) 
.  INDEX  TERMS 
CA  SECTION  NUMBER 

06 

) 

< 

CA  REGISTRY  NUMBER 

• 

07 

4 

MOLECULAR  FORMULA 

08 

CORPORATE  AUTHOR  s 

09  ^ 
10 

'    ABSTRACT                                     •  ' 
r  TEXT 

■   BA  CROSS  CODE 

.1« 

if 

< 

-   BA,'BiOSYSTEMATI.C  INDEX'      >  •  . 
„EI  CARD-A-LERT  CODE  ^ 

13  • 

& 

• 

• 

ORIGINAL  LANGUAGE 
AVAILABILITY 
PUBLISHER  / 
,  PRICE 

D^TE  OF  MEETING 

PARENT  JOURNAL               '  ,. 

UKIGINAL  ABSTRACT  SOURCE 

< 

'.figure  1. 

Data  Elements 

4 

and  IITRJ  Data  Type  Codes 
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7416-081368 


(Volume ,  Issue 
and  Abstract  Number) 


Directoirv! 


1 

'    1  . 

26 

*  (CODEN) 

4 

27 

14 

(Journal) 

2 

4l"  ' 

76. 

^   (Author (s) )' 

•  3 

117 

60 

(Title)-  • 

8 

177 

51 

(Corp.  Authbr) 

5 

228 

40 

(Index  Terms) 

13 

266"  * 

17 

(Language) 

string:  ■  ■  .  " 

JPCHAX/75/3/325-30/000071/J.     PHYS.  CHEM. VIBRONIC  EFFECTS  IN 
THE  INFRARED  SPECTRUM  OF  THE  ANION  OF  TETRACYANOETHYLENEDEVLIN, 
J.   PAUL$M00R5:,   JESSE  C.$ SMITH,  ■dONALD$YOUHNE,   Y0UNG$DEP.  CHEM., 
OKLAHOMA  STATE  UNIV.,    STILLWATER,   OKLA. $CA073Q00$  iR  SPECTRA 
ALKALI  METI'AL  SALTSORIG.   LANG.:^'.ENG        /   .  ^  , 


Complete.  Record  Appears  on  Tape  as : , 

7416-081368        1        1  '    '  '26        4  27 

177' 


117 


60- 


8 


51 


14  \  2 
228  40 


41 

13 


76. 
268 


17  JPCHAX/7.5/3/325-30/000071/J.  PHYS.  CHEM;  VI^RONli  EFFECTS 

IN  THE  INFRARED  SPECTRUM  OF  THE  ANION  OF  TETRACYANOETHYLENEDEVLIN, 

J.  PAUL$MOORE,  JESSE  C.$SMITH,   DONALD$YdUHNE:,  .YOUNG$DEP.~^CHEM.  , 

OKLAHOMA  STATE  UNI\^.  .    STILLWATER,   OKLA.  $CA"07 3000$  IR  SPECTRA 
ALKALlV  METAL  SALTSORIG ..  LANG .  :     ENG  ■  ,  :* 


Figure  2.     IlTRI  Formatted  Citation 
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In  the  directory  the  code  is  followed  by  the  starting 
position  for  the  actual  data,  and  an  indication  of  the  number 
of  characters  required  by*  the  data.     Thus,   in  Figure  2,  for 
the  |ecord  having  Citation  Number  81368  of  Volume  74,  Issue  16, 
in  Chemical  Abstracts  Condensates  there  is  a  CODEN  which  starts 
in  position  1  and  is  26  cha-racters  long.     The  next  kind  of 
(Jata  elemeat  included  in  the  record  is  a  Journal  Name  which  is 
a  type  4  code.     The  actual  data  starts  in  position  27  which  is 
one  position  beyond  the  end  of  the  CODEN  data  an\3  is  14 
characters  long.     The  next  data  element  is  the  Author  Name, 
which  is  a  type  2  data  code,   and  starts  in  position  41 
which  is  one  position  beyond  the  end  of  the  journal  data. 
Author  data  is  76  characters  long,   and  on  down  the  line.  If 
you  follow  through  Figure  2,   the  format  becomes  obvious.  The 
string  portion  of  Figure  2  shows  how  the  actual  data  for  this 
particular  reference  is  contained  in  IITRI  format  on  tape,  and 
thfe  complete  record  which  appears  in  the  lower  portion  of 
Figure  2  shows  the  entire  key  directory  ahd  character  string 
for  the  particular  record  as  it  appears  on  tape'. 

The  use  Of  data  element  codes  allows  us  to  handle  multiple 
varied  data  elements.     The  system  also  allows  us. to  add  new 
data  elements  and  new  data  type  codes  as  they  a<rise.     We  have 
no  way  of  knowing  what  new  data  elements  suppliers  may  include 
in  their  tapes  a  few  yearp  from  now.     However,  we  have  allowed 
for  9,999  different,  data  type  codes.     It  is  unlikely  that  we 
will  be  unable  to  accomodate  any  new  data  element  that  should 
come  into  existence. 
#> 

The  standard  IITRI  format  is  employed  for  any  data  base 
processed.     Our  method  for  handling  multiple  data  bases  is  to 
write  a  pre-processor  program  for  each  different  data  base 
that  is  handled  in  the  system.     The  pre-processor  program  re- 
formats the  data  that  is  contained  on  the  supplier  data  basQ 
and  puts  it  into  IITRI  format.     In  that  way  e^^ery  cJata ''base 
looks  the  same  to  the  searc^i  program,  and  all  data  ba^es  can 
be  handled  by  one  and  the  same  search  program. 

Preprocessors  *  *  , 

The  cost  of  writing  pre-rprocessor  programs  is  very  slight 
compared  to  the'  cost  of  maintaining  multiple  search  programs^ 
A  preprocessor  or  ^trfmat  conversion  program  takes'*  about  2  man 
weeks  of  programming,  effort  to  write.     We,  initialled  this  sys-  - 
tem  of  format  conversion  for^a  general i.zed  s^rch  ^ogram  three 
years  ago.     It  has  warkec^  very  well.    'It  has.  allowed  us  to 

,accommodate  chahges  Lr\  data  bases  whenever  the /need  arose. 
Fortunately  or  unfortun^ely  the  dajfer^  base  suppliers  do  mak^ 
frequent  charges  ih  the  format  and" content  g»f' their  data  bases. 
These  changes  may' not  be  largQ  or  terribly  significants  however 

,as  far  as  computer  programs  are  concerned,  any  thahc^e^can'  be 
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disastrouis  if  one  is  not  able  to  accommodate  it, 


use  of  r--i^i:ocessor  together  with  a  flexible  and 
e  file  structure,  have  made  it  possible  to  accommodate 
data  bases  with  varying  contents,  varying  data  ele- 
ffering  character  sets  and  character  codes,  and  dif- 


The 
expandabl 
multiple 
ments,  di 

ferent  tape  formats  and  record  formats. 
Hardware 


The  next  area  of  unpredictability  that  affected  our  design 
is  related' to  hardware.     We,   like  many  organizations,  have 
changed  computers  with  some  regularity.     We  did  not  want  to'  be 
locked  to  any  particular  configuration,  nor  to  develop  our. 
software^  in  an  .assembly  language  that  would  be  very  machine 
and  installation  oriented.     The  system  had  to  be  as  machine  . 
independent  and  installation  independent  as  possible.  For 
this  reason  we  decided  to  adopt  a  higher  level  compiler  lan- 
guage and^  to  use  a  family  of  computers  that  enjoyed  wide 
acceptance  throughout  the  country  and  internationally—lSM  360. 
The  compiler  language  we  adopted  was  PLA  a  language  that  is 
highly  atnenabl'e  to;^handling  natural  language  data.     PL/1  com- 
pilers are  available  on  IBM  3^  series  computers  and  they  have 
been  announced  for  Burroughs  an*^CDC  hardware. 

Software  Modularity 

We  wrote  the  programs  in  a  modular  fashion/in  order  to 
easily  accommodate  changes  to  any  portion  of  the  system.  The 
principal  modules  in  the  systems  include:     (1)  format  conver- 
sion for  reformatting  data  bases;    (2)  a  profile  editing  module, 
that  handles  the  terms,   logic  and  user  related  information 
that  .are  associated  with  each  profile;   (3)  a  search  module 
which  is  the  heart  of  the  system  and  mata(hes  user  profile  terms 
against  terms  on  the  data  base  within  the  requirements  of  the 
profile;    (4)  a  formatting  module  for  preparing* output  in  dis- 
semination format ?    (5)/  a  module  that  sorts  and  prints  the 
output;  and  (6)  a  statistics  gathering  module. 

Machine  and  Installation  Independence 

The  reasons  for  developing  our  software  in  a  machine  inde- 
pendent and  installation  independent  fashion  were  that  we  antici- 
pated our  own  hardware  might  change--which  it  did--,  we  wanted 
to  be'  able  to  install  our  system  at  other  organizations  that 
had  a  need  for  running  SDI  within  their  own  facilities,  and 
we  wanted  to  be  able  to  provide  profile-writing  training 
courses -and  workshops  at  other  locations.     Successful  achieve- 
ment of  machine  and  installation 'independence  with  IBM  equipment' 
is  evident  from  the  fact  that  we  have  run  bur  programs  at 
nin?  different  cpmputef  facilities  with  different:  hardware, 
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computer  models,  versions  of  the  operating  system,  peri?pherals , 
and  releases  of  the  PL/1  compiler.     Figure  3  indicates  the 
varieties  we  encountered  in  the  nine  facilities.     In  no  case 
did  we  have  any  real  difficulty.     Preparation  of  appropriate 
JCL  is  usually  all  that  is  required. 


Hardware :       IBM  360                 "Models:  40 

50 

65 

67 

75 

IBM  37Q                   Model:  155 

Any  computer  with  PL/1  Compiler 

Processors:                              '                         MF/T  . 

MVT 

■HASP 

...  \ 

Operatinq  System  Versions:  15-16 

•     17  ' 

r 

18 

19 

-      .  -     '  19.6 

20- 

et 

PL/1  Compiler  Releases:  4.1 

5 

<r 

5.2 

'                      *•  f 

Figute  3.     Machine  and  Installation  Independence 
of  IITRI  Software  ,  , 
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USER  ORIENTED  DESIGN 

Data  Base  Options 

The  current  awareness  or  SDI  (Selective  Dissemination  of 
Information)  system  has  been  operational  since  the  spring  of  * 
1969.     Data  bases  handled  by  the  Computer  Search  Center  (CSC) 
are  Chemical  Abstracts  Condensates,  Biological  Abstracts,. 
BioResearch  Index,  and  Engineering  Indexes  COMPENDEX.  Two 
more  data  bases  are  planned  for  the  near  future  and  other 
data  bases  will  be  added  depending  on  user  needs*     We  will 
handle  any  data  base  for  which  there  are  enough  subscribers. 
Searches  are  conducted  and  output  sent  to  users  weekly, 
biweekly  or  monthly  in  accordance  with  tbe  frequency  of  the 
particular  data  base  to  be^  searched. 

Prof  ile^-Opt^^ 

One  may  include  searchable  elements  as  positive  or  negati 
search  "terms,   i.e.,  one  may  require  the  presence  or  absence  of 
any  particular  search  terjn  to  qualify  a  citation  as  a  "hit" 
citation.     Among  the  searchable  elements  are  those  shoWn  in 
Figure  1^^    The  search  terms  may  be  single  words,  multi-word 
terms,  phrases  or  portions  of  words,  i.e.,  any  legitimate  char- 
acter string*    ^The  range  of  options  available  to  users  for 
their  profiles  is  shown  in  Figure  4. 

Profile  Features 

'    The  principal  features  built  into  the  system  to  achieve 
effective  profiles  are  the  following:     All  forms  of  term 
truncation;  full  free  form  Boolean  logic  vith  any  degree  of 
nesting;  grouping  or  linking  of  similar  terms;  and  weighting 
of'terms  according  to  user  assignment  of  relevance.  Weighting 
is  used  to  indicate  relative  importance  of  terms,  .separate 
closely  related  concepts  and  to  sort  output  in  relevance  order 

Truncation 

The  terms  themselves  may  be  truncated  to  facilitate 
retrieval  of  terms  containing  common  fragments*    The  four 
truncation  modes  allowed  are  shown  in  Figure  5.  The 
usefulness  of  right  truncation  is  usually  readily  under- 
stood.    Right  truncation  is  used  to  select  singular, 
plural  and  other  forms  of  words  that  contain  a  common  ' 
stem.     The  usefulness  of  left  truncation  is  not  so 
obvious  but  it  can  be  readily  demonstrated.     For  example 
one  might  use  the  left  truncated  tejrm  *MYCIN  to  represent 
antibiotics  ,and  retrieve: 
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Terms  -  anyth'ing  other  than  single  character 

Single  word 
Multi  word  * 
^  ^  Phrase 

Fraction  of  term   ^ 

Symbol  or  acronym 

Kinds  of  Terms  -  anything  on  t,\\e  data'  base 

Author 

Company  Name 

Country    -  ^  '  ^ 

Language 
Date 
^  CODEN 
BA  Cross  Code 
BA  Biosystematic  Index 
Keywords/Index  terms 
CA  section  number 

CA  registry  number  (when  available) 

Molecular  formula   (when  available) 

EI  Card-a-lert  Code 

Abstract  or  Text  term 

Any  term  on  any  data  base  searched 

Sorting  of  Output  ,  Output  Media ' 

Author  5"  X  8"  cards 

Weight  Paper 
Citation  number  Multilith  masters 

Tape 

Microfilm 


Figure  4.     Profile  Options 

Mode 

Function  , 

Example 

none 

requires  exact  match 

term 

of  a  term 

AZO 

left 

allows  substitution  of 

★ 

term 

any  prefix  on  the  term 

DI 

AZO 

right 

allows  substitution  of 

term  * 

any  suffix  on  the  term 

AZO  XY  ^ 

both 

allows  substitution  of 

★ 

term  * 

any  prefix  and/or  suf- 

DI 

AZO  METHANE 

NOTE: 

• 

*  denotes  truncation 

Figure  5.     Truncation  Modes 
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ACTOMYCIN 
ANTIMYCIN 
BIOMYCIN  * 
ERYTHROMYCIN 

NEOMYCIN 
•STAPHYLOMYCIN 
STREPTOMYCIN 

The  one  search  term  *MYCIN  substitutes  for  20  to  30 
specific  terms  which  are  not  shown.     The  use  of  * 
simultaneous  left  and  right  truncation  would  pick  up 
all  of  the  above  terms  plus  plural  forms  e.g., 
'STREPTOMYCINS',   and  those  with  terminal  punctuation 
e.g.  ,  "  ACTINMYCIN.". 


Logic 

Profile  terms  are  related  to  one  another  by  means  o 
logic  symbols.     The  logic  symbols  used  for  the  logical 
operators  AND,   OR,   and  NOT  are: 


Logical  Operators  Symbol 

AND  & 
OR  I 

NOT 


The  logic  expressions -for  profiles  can  be  as  specific  an 
involved  as  is  necessary  to  express  the  user's  question. 
While  most  expressions  are  relatively  simple,' any  ex- 
pression can  be  handled  by  the  system.     F&r  example,  the 
fallowing  expression  would  be^ legitimate : 

(((A&B)  I  (C|D|E|F))   &-JG)  (  ((H&I)  SHJ) 

Linking  or  Grouping  of  Terms 

In  order  to  simplify  the  writing  of  a  profile,  simil 
terms  may  be  linked  together  by  a  link  code  and  then  the 
group  will  be  referred  to  by  the  single  letter  code.  An 
example  is  given  in  Figure  6,  where  a  user  has  requested 
information  on  reactions  of  halogens  and  alkali  metals. 

User  Aids 

In  addition  to  providing  many  options  for  preparing  pro- 
files we  have  developed  a  number  of  aids  to  help  users  write 
profiles,   select  words,   truncate  words,   and  word  fractions. 
Index  Terms  and  Hit  Terms  are  printed  on  each  output  card  to 
provide  the  user  With  information  for  revising  his  profile. 


IIT    RESEARCH  INSTITUTE 


1  0-^7 


Terms 

Halogen 

Halide 

Fluorine 

Chlorine 

Bromine 

Iodine 


Lj-nk  Code 

A 
A 

A  ' 
■A 
A 
A 


AND 


Terms 

"Xlkali  metals 
Lithium 

Sodium 
Cesium 
Potassium 
Rubidium 


0 


Link  Code 

B 

'B 

B 
B 
B 
B 


Rather  than: 

(Halogen  |  Halide  I  Fluorine  |  Chlorine  |  Bromine  I  Iodine) 
&    (Alkali  metals  |  Lithium  |  Sodium  |  Cesium  |  Potassium 
rubidium) 


User  specifies ; 


(A&B) 


Figure  6  -  Linking  or  Grouping  of  Terms  in  a  Profile 


Search  Manual 

The  CSC  Search  Manual  explains  the  basic  techniques 
of  profile  writing,   including:     question  formulation,  con- 
cept identification,  concept  expansion,  profile  refine- 
ment and  profile  modification.     Methods  for  using  search 
terms,  truncation,   logic,   links  and  weighting  are  described. 
A  Supplemental  Guide  has  been  written  for  each  data  base. 
The  guide  demonstrates  profile  writing  tailored  to  the 
specific  data  base. 
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Truncation  Guide 

A  Truncation  Guide  illustrates  where  to  truncate  a 
term  in  order  to  retrieve  the  maximum  relevant  words 
with  the^.minimum  noise.     Figure  7  from  the  profile 
Truncation  Guide  demonstrates  the  retrieval  ability  of 
various  foryis  of  terms  related  to  the  concept  ANALYSIS. 

Frequency  Lists  ^  ^ 

A  Frequency  List  in  frequency  order  and  a  Frequency 
List  in  alphabetic  order  have  been  prepared  for  each  data 
base.     They  are  used  to  assist  in  selection  of  search  terms. 
A  high  frequerjcy  term  will  produce  a  high  volume  of  hits 
unless  it  is  combined  with  another  search  term  or  assigned 
a  low  weight.     A  low  frequency  word  might  be  used 
independently.     Frequency  lists  are  used  as  rough  indica- 
tors of  the  volume  of  output  one  might  expect  to  receive 
for  specific  terms.     Our  Frequency  Lists  have  been  pre- 
pared for  one  volume  of  each  data  base. 
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KLIC  Index 


*     A  KLIC  Index   (Key  Letter-In-Context  Index)  has  been 
prepared  for  each  data  base.     The  KLIC  index  is  a 
lexicographic  ordering  of  all  terms  in  a  data  base  by 
^^<^2lJi^^racter  (alpha,   numeric  or  special)    in  the  term 

Character  string.     It  is  a  permuted  arrangement  sorted 
by  character  with  the  balance  of  the  term  wrapped  around 
It.     See  Figure  8.     The  KLIC  is  used  for  linguistic 
research  and  as  a  user  aid.     By  consulting  the  KLIC  one 


'"i,^^^   "  u^^j.   axva.     ay  <j>jusuj.T:ing  Tine  x-biu  on( 

^^an  determine  the  retrieval  capability  of  a  particular 
^^^^.^-'^etter  combination  or  term  fragment.     The  KLIC  is  used 

to  Identify  letter  combinations  that  are  highly  specific 
and  would  therefore  be  discriminating  search  terms  e.g., 
*Tie  character  string  *YBD*  does  not  occur  anywhere  in 
the  CA  or  BA  data  bases  except  in  the  term  MOLYBDENUM 
(Note:     in  a  literary  data  base  it  would  occur  in  the 
mythological  character  SCYLLA  and  CHARYBDIS) .  Thus, 
*YBD*  could  be  used  as  a  search  term  .for  molybdenum. 

rQn  the  other  hand,   letter  combinations  that  occur  fre- 
q^uently  in  many"  irrelevant  terms  should  be  ^avoided 
e.g.,   the  letters  RNA  for  ribonucleic  acid  coiMd  be  used 
■^vf^r^  search  term  assuming  one  did  notCspecify  sbftultaneou? 

Nieft  and  right  truncation  *RNA*.     The  simultaneous  " 
L^runcation  mode  would  retrieve  more  than  200  irrelevant 
1^    words.     Some  of  these "are: 

^  .  ALTERNATE  *  .  ... 


BARNACLE 


CARNATION 

DIURNAL 

FINGERNAIL 

MATERNAL 

B 1  gram_  FVr  eguenc  Y_Li  s  t 

The  KLIC  Index  indicates  where  letter  combinations 
occur  and  oar  Digram  Frequency  List,  which  provides 
a  frequency  count  for  every  two  letter  combination  in 
the  data  base,   indicates  how  often  each  bigram  occurs. 
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ANIM 

AL  S  •  / 

ASPH 

/AtT 

/ 

YST 

AL  S  •  / 

ALT 

/ 

M  I  N'P  R 

AL  S  •  / 

ALT 

/ 

Of     I  T 

ALS'  / 

^>lsQH-S 

ALT 

/ 

CfTM  IC 

SULFTTS" 

-ALT 

/ 

MA^f  ! 

TP  I  COB 

alT" 

7  I  ^ 

v-P  AO  I C 

ALS*  /CA 

ACYLCOB 

ALT 

/ 

ALSACE  / 

FUSFO-S 

ALT 

/ 

/ 

A  L  S  A  C  I  E  M  '^l  F  S 

WATEP^S 

ALT 

/ 

^l.  SA"^  / 

I  RON-COB 

ALT 

/ 

AL  S  AM  I  MA.  / 

MOLTEN-S 

ALT 

/ 

V 

AL  S  AP  I  A  7 

"~     ^  AOIOCOB 

ALT' 

/  " 

S 

ALS^URY  / 

Z I NC-COn 

ALT 

/ 

F 

AL  SE  / 

AM^I NECOR 

ALT 

/  •  " 

^  A.  T 

CAT 

AL  S  E  / 

AQUEOUS -S 

ALT 

/  \ 

W 

ALSERi  / 

CPGAMOCO^ 

ALT 

/  \ 

S 

ALSES~LA  / 

SULFATE-S 

ALT 

y,A ..... 

"w 

ALSM  / 

cnpf^c^-co'^ 

ALT 

w 

A  t    C  t  J  / 

ALSHt  / 

I  0  \'  (  I  I  )  -  s 

ALT 

/ 

TH 

AL  S  I  M  I  NE  / 

^JI  CKFL-COP 

ALT 

/ 

')I  HYn'OTH 

AL  S I  Ml \F  / 

AM  NATOCOB 
A'^3  I  OE-COB 

ALT 

/  AL 

AL  S       IDES  / 

ALT 

/C 

M/D 

AL  S  IriA-^S  102- 

\'  ICKEL-CO^ 

ALT 

/I RDM- 

"alSI06  / 

~'  YR\Z  ]  N'ECOB 

ALT" 

7p 

^ER  1ST 

ALSIS  / 

L AOI UM-COR 

ALT 

/PAL 

L  I 

ALSI206  r' 

^  ANGA.NJATF/S 

ALT 

- 

 CA 

AL  S  1  / 

ATINUM-COR 

ALT 

/PL 

MA 

AL  S  I  ^0  8  / 

CME  I  DE  AfvJST 

ALT 

AS 

C  ZOCH^ 

AL  SK  I  / 

MAR  I UM-COB 

ALT 

/SA 

■  NJ  A  ^  I  F  ^ 
MICH 

Aft      ^  t>    Y           /  ^     T    #^  ^  1  1 

AL  SK  L  /R  I  SCH 

ALb\rt  / 

R I ALLYLC08 
TTR  I  tlX-Cnp 

ALT" 
ALT 

/T 
/Y 

TEMP 

ALSKI t  / 

S 

ALT. 

/ 

KATCH 

ALSKI t  / 

/  COB 

ALT(  {  I  I  )U^vEA 

AL  SK  I  •  S  / 

YANI OF ) COB 

ALT( 1)    /I SOC 

0 

ALSKY,  / 

L  T { I  I  ) /COB 

ALT( II  /CO^A 

AL  SMAN!\It"  1 

"COB 

ALT{  II  / 

A  L  S  N  S  / 

OXI  r^F-CQB 

ALT{  I  I  )  / 

AL  SO  / 

YR  \7\  NJECC^ 

ALT (  I  I  )  /P 

S 

ALSOL^^  / 

V  LIGANOCOR 

ALT(  II  )  .  / 

Aft       ^   ^>  ft        V    •  h  ^  § 

AL  SOL  I NE  / 

PTIOF  /COB 

ALT(  I  I  )-nioe 

ALSn\'  / 

AMIC   /  COB 

ALT(  I  I  ) -GLU^ 

AL  SON  r  / 

'  I  L(UR  *  /  COB 

ALT(  I  n-^'IT^ 

H 

Aft     ^  ▼  ^         f    *  4 

ALSTFDI I  / 

/COB 

ALT!  I  I 

ALSTEMt  / 

L  T (  n  /COB 

ALT(  I  I  ) /CO^A 

/ 

AL  STON^"  I  \^  » 

C^B 

ALT(  I  I )  ,  / 

/ 

Aft                               f>    rft^>^  ^ 

^L  S  TO^'r  •  '  '^T  > 

^  AMM  I  Nf:  COB 

ALTdll  /H^X 

ALST">NIA  / 

TAMMI  \'rf  OB 

ALT( I M  /Pf\ 

Al  S  Tr)\'  I  T  c  / 

COB' 

ALT{ T  II)/ 

A  L  S  1 0  P  M  V   L  I  \ 

AOUOCO^ 

M  T  ( 

III)  / 

'  G 

A  L    T  Y  A  '^J  t  / 

COR 

ALT(  I  I  I  ) .  / 

V 

Al       ^ll^AkiA  J 

A  L  S  U   ^  ^'  A  / 

IC  ALL  /C^^B 

ALT( I  II  ) -no- 

N APHT w 

^L  S ML  '     i  r  / 

OT  r  i>    /  Cl  ^  ^ 

ALT (  I  I  I  )-p-- 

M 

ALT  / 

COi^ 

ALT  (  I  I  IJ  t  / 

S 

ALT  / 

s 

ALT  ) 

1 

^LT  / 

ALT) 

1 

c-^  ^ 

ALT  / 

A\'ST 

A  L  n 

t  .-'J 

Figure  8.     KLIC  Index  Entries 
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Communication  with  User 

In  order  to  maintai-n  good  rapport  with  users  and  to  be 
sure  that  their  profiles  are  functioning  efficiently  to 
provide  the  desired  information,  CSC  uses  many  avenues  of 
communication  Vith  users.  Among  them  are: 

o    Unl\iited  profile  changes 

o    Low  cos^\profile  switch 

o  .Evaluation  r^orts 

{95%  response\^mplies  use) 

o    Feedback  cards  \y 

o    Continuous  precision  calculations 

o    Telephone  contact 

o    Comments  on  profiles  to  suggest "changes 
in  logic  weighting,   and  grouping  of 
terms,  or  to  suggest  use  of  new  data 
elements  or  new  terminology 

The  concern  for  users  is  of  extreme  importance  to 
information  centers.     Information  systems  are  designed  to 
be  used  and  if  the  clients  are  not  satisfied  with  the  service 
they  will  not  use  it. 
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REASONS  FOR  USING  THE  SDI   SERVICES  OF  AN  INFORMATION  CENTER 


There  are  some  scientists    that  have  no  need  for  or  cannot- 
use  SDI.     There  are  those  who  have  no  need  to  monitor  or  use 
the  new  developments  i^^  a  field,  those  for  whom  there  is  no 
appropriate  data  base,  'tho^e  whose  area  of  interest  is  so  ex- 
tremely narrow  and  highly  specialized  that  the  appropriate 
terminology  is  unlikely  to  be  used  by  authors  in  titles  or  by 
indexers  in  index  terms,  and  those  who  can  truly  monitor  the 
publications  in  afield  by  scanning  a  handleable  number  of  jour- 
nals.    The  members  of  this  group  are  not  nearly  so  numerous  as 
they  mignt  think. 

It  remains, then  that  there  are  many  scientists  who  do  have 
a  need  for  SDI,  and  there  are  advantages  and  reasons  for  using 
the  SDI  services  of  an  information  center.     They  are:     (1)  the 
extensiveness  and  inclusiveness  of  the  broad  coverage,    (2)  tho- 
roughness of  search;    (3)   int^rdisciplinariness;    (4)  high  recall; 
(5)  cost  effectiveness;   (6)   sjp'eed  and  regularity;   (7)  timeliness; 
(,8)  multiplicity  of  data  bases;    (9),  automatic  preparation  of 
files  in  standardized  format;  and  (10)  cost  of  data  base  pre- 
paration and  operation  of  an  SDI  system  vs.  subscriptions. 

Coverage 

The  first  and  most  obvious  reason  for  SDI,   is  that  for  the 
most  part,   it  simply  cannot  be  done  manually  any  more  because 
the  volume  of  publications  is  so  large.    While  formerly  one 
could  "eyeball"  Chemical  Abstracts  or  other  secondary  sources 
to  cover  the  literature  in  his  area  of  specialization,  this 
would  now  be  a  monumental  task.     The  journal  coverage  of  data 
bases  such  as. CAS  Condensates,  BA  Previews  and  others  is  quite 
extensive;  for  example,  in  the  field  of  chemistry  a  year's 
worth  of  Condensates  includes  300,000  chemical  references  taken 
from  approximately  20,000  journals  —  such  extensive  coverage 
cannot  be  duplicated  elsewhere.     In  the  field  of  biology,  BA 
Previews  includes  approximately  250  000.  references  selected  from 
approximately  8,000  journals.     in  engineering  the  COMPENDEX  data 
base  produced  by  Engineering  Index  includes  75,000  references 
a  year  from  3,500  journals.     There  are  many  additional  data 
bases  providing  extensive  coverage  in  their  respective  fields. 

Thoroughness  of  Search 

Machine  searching  is  more  thorough  than  human  searching. 
The  computer  will  search  every  profile  term  against  every 
citation  on  the  data  base  -  no  reference  is  overlooked  and 
human  fatigue  is  not  a  factor.     Once  the  term  matches  have  been 
made'  then  the   'hit  '  citations  are  checked  to  see  if  they  satisfy 
the  logic  the  user  employed  to  relate  the  terms  to  each  other. 
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Consistency  of  Search 

A  machine  search  evaluates  every  citation  in  the  data  base 
in  exactly  the  same  manner.     The  search  strategy  and  criteria 
for  selection  are  employed  in  the  same  way  for  the  first  citation 
on  a  tape  and  for  the  last  citation.     While  a  human  searcher  is 
likely  to  be  affected  by  fatigue  and  boredom,   the  computer  is 
not .  ' 

Inter  disc  ipl  inari-ness  • 

One  of  the  most  important  reasons  for  using  a  machine  search?* 
is  related  to  the  interdisciplinary  character  of  research.  The 
user  may  be  working  in  a  f  lejj^d  that  requires  coverage  several 
subfields  within  .a  data  base  or  several  data  bases.     For  this 
reason  it  would  be  very  d;fficult  to  identify  and  manually  search 
all  the  appropriate  portions  of  a  data  base.     It  is  also  becoming 
increasingly  difficult  to  anticipate  which  area  of  subspeciali- 
zation  a  particular  journal  article  might  be  assigned  to  within 
a  secondary  source.     For  example,   an  article  dealing  with  a 
particular  air  pollutant  might  find  its  way  into  any  one  of  a 
number  of  sections  in  CA        all  of  which  would  be  correct. 

An  organic  chemical  could  be  assigned  to  one  of  the^-organic 
chemistry  sections;   if  de^tected  by  some  analytical  device  it  could 
be  attributed  to  one  of  the  analytical  chemistry  sections;  as  an 
air  pollutant     to  the  air  pollution  section;   if  it  were  inhaled 
and  produced  a  biologic  effect  it  might  be  assigned  to  toxicology; 
if  it  were  deposited  on  a  body  of  water  it  would  apply  to  water 
pollution;  and  if  deposited  on  the  ground,  it  might  apply  to  the 
section  on  soil  and  plant  growth. 

This  IS  just  one  example  but  there  are  many  and  this  inter- 
disciplinary factor  has  been  cited  by  our  users  repeatedly  as 
being  a  real  advantage.     Many  users, who  had  manually  searched 
CA  for  years, thought  they  were  doing- a  good  job  and  thought ^they 
knew  exactly  which  sections  of  CA  would  be  appropriate,  were 
very  surprised  to  find  that  the  SDI  search  located  relevant 
references  from  sections  they  would  never  have  examined. 

High  Recall  n 

Because  of  the  previously  mentioned  advantages  it  is  possible 
for  the  user  to  achieve  a  higher  recall  with  SDI  than  he  could 
manually.     Naturally,  the  value  of  high  recall  vs.  high  pre- 
cision varies  from  user  to  user  depending  on  his  objective  in 
using  the  service.     The  user  or  the  profile  coordinat6r  acting 
for  the  user    must  weigh  the  tradeoffs  and  determine  whether 
he  can  afford  to  miss  a  few  relevant  references  in  order  to 
achieve  high  precision  or  whether  he  can  afford  to  retrieve 
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some  irrelevant  references  in  order  to  ensure  his  not  missing 
anything. 


icall 


An  example  of  £he  recall  ability  of  the  system  is  the  case 
of  one  of  our  user  compai^es  that  maintained  a  manual  search 
system  parallel  to  the  SdV^s  provided  and  compared  the  output 
for  over  a  year.     For  many  yfears  and  throughout  the  experimental 
year  20  bench  chemists  divided  up  the  sections  of  CA  and  searched 
for  references  relevant  to  the  companies  areas  of  research.  The 
output  of  their  search  was  forwarded  to  their  technical  library 
as  was  the  output  of  our  SDI.     Eleven  profiles  were  written  and 
run  against  Condensates.     The  results  of  the  study  was  that 
the  manual  search  identified  5%  more  relevant  references  than 
the  machine  and  the  machine  identified  15%  the  chemists  missed. 
Simply,   if  total  relevant  references  identified  by  both  sources 
is  considered  100%  then  the  recall  for  the  manual  search  by 
professional 'chemists  was  87.5%  and  the  recall  of  the  SDI  system 
was  95.8%.     Naturally,   the  SDI  produced  about  60%  non-relevant 
citations  but  the  time  req;uired  to  evaluate  and  reject  these 
was  not 'signifi<:ant .     The  truly  significant  factor  is  the 
economic  one. 

Cost -Effectiveness  •  ^ 

The  value  of  an  SDI  system  can  be  measured  in  terms  of  time  - 
saved.     There  are  many  other  valuer  but  cost  effectiveness  is 
the  criteria  that  is  most  often  applied  by  the  subscriber.  Not 
all  cases  are  so  dran\^tic,  but  in  the  example  cited  above  the 
cost  of  the  manual  search  using  average  rates  of  $20,000/man 
year  was  $87,000  whereas  the  cost  for  the  eleven  profiles  was 
$2,800  (or  $4,500  using  our  current  price  schedule  where  sub- 
scription rates  are  related  to  number  of  profile  terms). 

An  American  Chemical  Society  survey  reported  in  Chemical 
and  Engineering  News  47:3;,  July  28,   1969  that  the  average 
industrial  chemist  sp^ruls  11.8  hours  per  week  in  current 
awareness  and  literature  searching,     of  the  7.5  hours  spent 
on  current  awareness  SDI  effected  an  average  savings  of  3.1 
hours  £o^  every  hour  spent.     Assuming  an  expense  to  the  company 
of  $15.00/hours  the  savings  over  a  year  would  be  almost  $2,500. 
Perhaps  much  ^  the  time  would  have  been  spent  in  off  hours 
and  would  not  n^ve  netted  the  company  additional  productive 
man  hours.     However;:,  conservatively  speaking,  a  rule  of^thumb 
might  be  tha^  if  theNjJser  saves  only  one  hour  per  week,  at 
$15.\pO/hours  the  cCst  t^lQQ, 00  which  is  considerably  more 
than\the  average  cost  of  $2'B0/year  fpr  an  SDI  profile. 
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Speed  and' Regularity 

A  machine  search  is  done  very  rapidly  and  the  search  for 
each  user  is  done  each  week  on  a  regular  basis.     The  machine 
does  not  have  time  off  for  illness  and  vacation.     The  human 
searcher  on  the  other  hand,  may  become  burdened  with  other- 
tasks  and  may  not  have  time  to  go  to  the  library'  each  week. 
Even  in  cases  where  the  users  area  of  research  is  narrow 
enough  and  well  defined  so  that  he  can  res^dily  do  his  own 
current  awareness  of  CA  by  searching  a  limited  and  manageable 
number  of  sections  the  question  remains,  will  he  do  it  and 
will  he  do  it  on  a  regular  basis?     If  the  library  is  located 
some  distance  from  his  office  or  if  he  is  one  of  many  users 
on  a  long  distribution  list,   he  is  unlikely  to  read  the  current 
issue  when  it  is  published.     SDI  provides  him  his  output  on  a 
regular  basi^  regardless  of  these  circumstances.     At  IITRI,  we 
have  been  presiding  SDI  from  CA  Condensates  on  a  production 
basis  for  two  years  and  we  have  never  missed  a  weekly  run  due 
to  anything  related  to  our  system.     (  In  one  instance  the 
supplier  was  late  in  producing  the  tape  causing  a  delay  in 
the  search) . 

Timeliness  f 

The  magnetic  tape  version  of  a  secondary  source  is  usually 
made  available  prior  to  the  publication  of  the  hard  copy.  At 
IITRI  the  Condensates  search  tape  is  received  1-2  weeks  prior 
to  publication  of  the  Chemical  Abstracts.     The  tape  is  held  for 
a  week  to  provide  a  backup  copy  in  case  any  issue  is  deleted  in 
production  or  in  the  U.S.  mail.     The  search  is  conducted  and 
output  received  by  users  approximately  1-2  days  prior  to  receipt' 
of  the  hard  copy  in  their  library.     This  enables  the  user  to 
check  abstr^acts  at  the  time  he  reviews  his  output  citation  card's. 

Multiplicity  of  Data  Bases 

Many  organizations  require  journal  coverage  from  a  variety 
of  '^sources  in  widely  divergent  disciplines.     They  need  to  use 
several  data  bases  or  secondary  sources  and  if  this  is  done, 
in-house  they  must  be  aware  of  the  coverage,   terminology  and 
indexing  practices  of  the  different  suppliers.     And,  as  changes 
in  the  data  base  occur  they  must  be  accommodated  by  the  ^(^Irch 
in  preparing  search  terms  and  strategies. 

Automatic  Preparation  of  Files  in^andardized  Format 

The  output  of  an  SDI  search  is  cita^^acis  that  are  printed 
in  a  standardized  format.     In  cases,   suefl|^s  our  system  at 
IITRI,  the  citations  are  printed,  together* with  index  terms 
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and  hit  terms,  on  cards  thus  providing  a  unit  record  file  from 
which  irrelevant  or  obsolete  material  can  be  deleted.  In 
contrast  with  printouts  on  computer  paper  the  citations  can 
,   be  sorted  and  filed  according  to  the  user's  preference  and  it 
is  not  necessary  to  retain  irrelevant  citations.     The  user's 
file  can  be  purged  and  rearranged  as  needed. 

Even  in  erases  where  a  user  can  search  the  current  issues 
of  CA  or  another  data  base  it  might  be  worth  it  to  subscribe  to 
SDI  solely  for  the  convenience  of  having  the  appropriate  re- 
ferences printed  on  cards  m  standard  format.  Few  scientists 
relish  copying  citations  out  of  CA  nor  is  it  always  ponvenient 
for  a  secretary  to  go  to  the  library  to  type  them.  ' 

Cost  of  Data  Base  Preparation  and  Operation  of  an  SDI  System 
vs.  Subscriptions 

Cost  of  Data  Base  Preparation 

The  cost  of •  preparing  a  data  ba^e,  which  is, borne  by 
the  suppliers  and  incurred  only  once,   is  high,   t^^^  infor- 
^  mation  center  and  subsequently  the  subscriber  pays  only  a 

small  fraction  of  the  cost. 

The  Condensates  d^ta  base  covers  approximately  20,000 
primary  journals,   includes  more  than  25,000  new  citations 
•    per  month,  and  the  cost  of  preparing  this  data  base  in 
machine  readable  form  is  very  high.     It  is  unlikely  that 
an  individual  company  or  other  user  organization  could 
afford    to  select,   abstract,  edit,   index,  keypunch,  compose, 
print  and  otherwise^  prepare  such  a  collection.     Even  pre- 
paring a  data  base  from  the  journals  known  to  be  of  interest 
to  an  individual  company  would  probably  cost  much  more  than 
the  total  subscription  fees  required  to  provide  SDI  from 
the  same  sources.  *• 


Clearly,  data  base  preparation  is  an  expensive  task 
and  user  organizations  would  not  want  to  duplicate  the 
efforts  of  the  data  base  suppliers.     The  next  alternative 
is  that  of  processing  data  bases  in-house  and  operating 
an  SDI  system.'    This  too,   is  more  expensive  than  onei^night 
anticipate.     The  mere  purchase  or  lease  of  a  data  base  is 
not  all  that  is  required  to  run  an^'SDI  system.     In  fact, 
the  costs  associated  with  obtaining  a  data  base  are 
extremely  small  relet ivG  to  the  balance  of  the  expenses. 

4 

Cost  of  Operation  of  an  SDI  System 

Operating  costs  can  be  considered        four  in  kind: 
data  base;  materials,  equipment  and  furniture;  machine 
time;  and  personnel.     Details  for  some  of  the  specific 
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cost  Items  are  given  in  Figure  9. 


Data  Base  Related  Costs: 

Purchase  or  lease  of  data  base 

Royalty  payment  to  data  base  supplier  for  all  hits 
or  citations  disseminated 

Materials,    Equipment  and  Furniture; 

Purchase  or  lease  of  keypunch  and  terminals  if  needed 

Purchase  of  expendable  materials  e.g.  cards,  paper 
products,  postage,  office  supplies  and  various 
out-of-pocket  expenditures  including  'travel 

F^urniture  e.g.:     special  file  cabinets  to  hold  tapes, 
printouts,   listings,  etc.,   and  standard  office 
furniture. 

Machine  Time; 

Production : 


Reformat  data  base 
Edit  prof  lies 

Prepare  profile  input  for  search 
Search 

Sort  and  format  output 
Print 
Statistics: 

Tape  library  maintenance 
Research  and  Development; 

Personnel  Time : 

Management 

Marketing 

Systems  design 

Programming 

Profile  maintenance 

Keypunch 

Clerical 
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The  efficient  operation  of  a  center  requires  (1)  A 
management  component  to  direct  and  oversee  all  production, 
research  and  development  activities;    (2)  A  marketing  com- 
ponent to  develop  a  market  and  ensure  use  of  the  center 
Ci)  A  systems  designer    is  required  where  software  systems 
are  to  be  designed  in  house^-this  type  of  staff  is  not 
essential  where  available'  software  is  used  (4)  Programming 
capabilities  are  required  to:     (a)  develop  software 
(b)  modify  existing  software  for  internal  use,   (c)  change 
and  improve  software     (d)  adapt  to  changes  in  operating 
system,  compiler  and  configuration  and  (e)  adapt  to  changes 
imposed  on  the  center  by  data  base  suppliers  such  as  new 
data  el^ents,  changes  in  data  base  format,  changes  in 
record^g>rmat ,  change's  in  machine  code,  changes  in  storage 
densiti^,   etc.    (5)  A  subject  specialist  is  needed  to 
handle  profiles  and  maintain  liaison  with  users. 

Profile  coordination  involves  writing  profiles; 
monitoring  profile  performance,  output  and  user  response; 
keeping  abreast  of  data  base  changes  such  as  indexing 
procedures,  vocabulary,  subject  and  jpurnal  coverages 
etc.;  updating  and  modifying  profiles  in  response  to 
changes  in  user  interest  or  in  data  bases;  and  maintaining 
user  records  with  respect  to  precision  values,  number  of 
hits  and  any  other  type  of  statistic  the  center  may -wish 
to  maintain. 

The  data  base  expenditures  and  materials  are  the 
smallest  items  in  a  center  s  budget.     In  our  own  case  they 
amount  to  less  than  five  percent  of  our  expenses.  Computer 
time  and  personnel  time  are  the  major  cost  el'ements.  Some 
people  have  been  rushed  into  thinking  that  for  the  re- 
latively minimal  investment  of   $5,000  to  $10,000  for 
acquisition  of  data  bases  they  can  process  tapes  in-house 
or  operate  centers.     This  is  a  very  unrealistic  outlook. 
A  rule  of  thumb  might  be  that  efficient  processing  of  a 
data  base  would  require  at  least  100-200  users  and  an 
information  center  needs  about  $100,000  per  year  to 
operate.     This  is  a  liroad  generalization,   since  I  fully 
realize  that  software  efflciences  and  personnel  require- 
ments will  vary  among  centers  depending  on  the  overall 
system    computer  installation,  quality  of  profiles,  re- 
quirements of  users  and  services  provided.     Bear  in  mind 
that  these  costs  assume  that  software  is  already  avail- 
able and  need  not  be  developed. 

Cost  of  Subscription 

The  expense  involved  in  purchasing  subscriptions  to 
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the  SDI  services  of  information  centers  is  by  far  the 
most  economical  way  of  providing  SDI  to  the  average 
company.     Individual  profiles  range  in  cc^st  from 
approximately  $100  to  $500  per  year  depending  on  the 
center,  the  data  base,  the  number  of  terms  and  the 
volume  of  output.     The  average  company  can  purchase 
many  subscriptions  to  many  data  bases  for  a  small 
fraction  of  the  cost  required  to  process  in-house. 
Use  of  existing  centers  provides  the  user  organi- 
zation with  far  more  flexibility  in  terms  of  varieties 
of  data  bases,  types  of  services,  etc.  than  could  be 
experienced  in-house.     The  user  company  has  no  commit- 
ment to  a  hardware  configuration  or  software  package, 
nor  does  he  have  a  payroll  to  meet. 

The  number  of  centers  in  operation  in  the  U.S. 
today  IS  more  than  adequate  to  meet^  the  needs  of  the 
limited  market.     They  and  their  sponsors  have  borne 
the  cost  and  he a da c he s  as soc i a ted  with  des ign  and 
development,   and  they  are  now  re^dy  to  share  the 
fruits  of  these  efforts  by  providing  service  to 
industry,  academic  institutions  and  government 
facilities. '  For  the  most  part,  .development  costs 
are  not  passed  on  to  users  in  subscription  fees 
only  operational  costs        so  it  behooves  members  of 
the  scientific  community  to  take  advantage  of  the 
investment . 


\ 


IIT    RESEARCH  INSTITUTE 


A  NEW  APPROACH  TO  COMPUTER  ASSISTED  DESIGN  - 
OF  ORGANIC  SYNTHESES 

« 

W.  Todd  Wipke 
Department  of  Chemistry 
Princeton  University 
Princeton,  New  Jersey  085^0 

"In  other  words,  automated  planning  of  synthetic  routes  could  en- 
hance progress  by  five-fcfld  and  develop  the  state  of  the  art  to  the  same 
high  level  achieved  in  the  elucidation  of  structure.     It  seems  quite 
reasonable  to  anticipate  tha^  computer  automation  will  eventually  come 
to  syntheti-c  chemistry  just  as  new  instrumentation  has  come  to  structural 
problems*^  ^ 

To  be  perfectly  fair,  it  must  be  admitted  that  such  automation 
cannot  be  expected  to  materialize  soon.     The  prodigious  problem  of  pro- 
gramming stands  as  a  barrier  which  will  be  assailed  only  where  the  in- 
centive is  equally  great, 

• 

This  paper  will  higlight  the  important jpspects  of  this  "prodigious 
problem  of  programming"  paying  particular  attention  to  the  issue  of 
representation.     Part  of  the  material  presented  here  is  drawn  from  the 
first  synthesis  program,  OCSS,  by  the  author  and  E.J.  Corey. ^  part  from 
LiiASA,  which  is  OCSS  with  a  considerably  extended  chemical  module  by  the 
author,  E.Ji  Corey.  R.D.  Cramer, III,  and  W.J.  Howe.^    The  remainder 
pertains  to  the  new  synthesis  program  being  developed  at  Princeton  by  ' 
the  author.  T.M.  Dyott,  and  P.  Gund. 

A  View  of  the  Problem 

Designing  a  synthesis  is  a  challenge  fo^  a  chernist;-be;causc  it 
focuses  all  of  organic  chemistry  toward  the  solution  i'f  one  problem. 
In  the  same  way,  it  is  a  challenge  to  te^ch  a, computer  how  to  plan 
syntheses,  because  it  brings  to  bear  not  only  the  field  of  organic 
chemistry,  but  also  that  of  computer  science  toward  the  solution  of  one 
central  problem, 

By  building  such  a  program'we  gain  problem  solving* capability,  we 
gain  understanding  of  our  owrt  problem  solving  processes  in  general  and 
that  of  synthetic  analysis  in  particular,  and  perhaps  most  important, 
we  gain  new  ideas  for  labotatory  research.    Anyone  who  has  programmed 
a  computer  knows  that  in  programming  a  solution  to  a  problem,  one  de- 
velops a  complete  understanding  of  the  problem.     In  removing  the  rnys- 
teries  from  synthesis  we  must.be  careful  not  to  simply  transfer  them  to 
the  atea  of  computer  science.     That  is.  we  must  al^o  remove  the  mys- 
teries of  programming  comple^  problem  solvers. 

Synthesis  can  be  viewed  from  several  different  directions  depending 
in  part  on  the  motivation  fon  the  synthesis.     The  first 'type  might  be 
characterized  as  an  industrial  point  of  vjow,  ^iven  A,  what  can  be  made 
from  ii£      t]>o>.gi ven  range  of  reagents  in  a  few  steps.     By  necessity 
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Type  I       A   ■ — ^  ? 

Type  II      A   ^ — ^  T, 

Type  III    ?   ^— T 

this  problem  requires  working  forward.    A  type  II  synthesis  is  one 
characteristic  of  very  complex  molecules  such  as  progesterone.  Here 
the  target  T  is  given  and  a  similar  complex  available  material  A  is 
chosen  as  starting  material.     The  problem  is  to  find  the  interconnect- 
ing paths  between  them.     Finally  there  is  tfie  type  III  synthesis^ 
characteristic  of  total  syntheses.     The  target  is  specified^  but  there 
is  no  prejudice  as  to  reactions  or  starting  materials.    Of  course  this 
problem  requires  working  backward  for  a  solution.  ^  We  are  currently 
most  concerned  with  this  lattei;  approach,  but  methods  developed  for 
the  type  III  will  have  some  applicability  for  the  other  types  as  well. 

The' general  theory  of  synthetic  analysis  as  applied  to  complex 
molecules  has  been  previously  discussed^  and  a  particularly  simple 
yet  powerful  technique  for  the  design  of  organic  syntheses  has  been  termed 
the  "logic-centered"  approach.     In  its  "pure"  form  the  method  starts 
with  the  synthetic  objective  or  "target  molecule"^  T,  and  by  synthetic 
inference  derives  the  set  of  precursor  molecules  which  can  be  expected 
to  be  converted  to  the  target  molecule  by  one  synthetic  step  or  a  se- 
quence of  simple  steps.     Repetition  of  this  process  treating  each 
precursor  as  a  new  target  generates  the  familiar  "tree"  of  synthetic 
intermediates..    An  important  goal  of  the  "logic-centered"  approach  is 
to  lead  to  precursors  which  are  simpler  than  the  target.    The  analysis 
terminates  upon  production  of  precursors  which  a^e  acceptable  as.  Starting 
materials'. 

The  "logic-centered"  approach  while  it  lacks  the  exciting  intuitive 
insights  of  the  chemist  is  still  quite  powerful  and  is  well  suited  for 
computer  execution.     The  first  synthesis  program,  OCSS  (Organic  Chemical 
Synthesis  Stimulation)^  used  these  principles.     The  validity  of  the  syn- 
-  theti^  tree  produced  by  OCSS  is  a  true  reflection  of  the  effectiveness 
of  the  logic-centered  principles,  for  a  computer  cannot  as  yet  rely  on 
"prior  synthetic  experience." 

The  Computer's  Role  is  Obvious,  or  is  it? 

It  sounds  so  simple «  "feed  the  computer  all  known  reactions  and  let 
it  apply  them  in  every  possible  way."    This  brute-force  approach  fails 
even  with  the  biggest  computer,  because  the  number  of  possible  branches^ 
in  the  synthesis  tree  is  so  large.     It  can  be  argued  that  synthesis  is 
more  complex  than  chess  and  it  has  been  estimated  that  there  a^v  lO*^"^^ 
different  paths  in  a  complete  chess  maze.'^    Clearly  the  brute-force 
method  won' t  do. 


5.S4 


10-62 


Ihe  clue  to  intelligent  behavior  is  selective  search  guided  by 
heuristic  rules.     One  such  rule  has  already  been  mentioned--the  striving 
for  simplicity.     Thus  transformations  which  do  not  ultimately  simplify 
the  target  are  pruned  as  early  as  possible.     Good  chemical'  heuristics- 
minimize  the  ever  present  risk  of  pruning  away  good  solutions. 

In  theory  the  computer  could  solve  the  entire  synthesis  problem 
without  human  intervention,  but  in  practice  we  have  insufficient  know- 
ledge of  chemical  reactivity  and  of  proper  evaluation  criteria  tp  permit 
a  closed  solution.     To  close  the  loop  we  use  a  chfemist  to  evaluate 
results  and  guide  further  search.     By  saving  some  of  our  human  chemical 
input  for  execution  time,  we  gain  flexibility  and  power  to  meet  situations 
unanticipated  during  the  programming.     Programming  the  closed  solution  is 
hampered  also  because  of  the  difficulties  in  fitting  such  large  problems 
to  current  computer  memories  and  of  organizing  and  controlling  it  oncQ 
there,     fhe  chemist-computer  combination  for  now  appears  to  be  the  best 
arrangement.  • 


The  General  Scheme 


The  elementary  problem  solving  scheme  shown  below  is  applicable 
to  the  problem  of  synthesis  regardless  of  direction  of  analysis,  forward 
or  backward. 

1.  Communication  of  problem 

2.  Perception  of  problem 

3-   Selection  of  chemical  operators 

h.  Creation  of  sub-problems  (precursors  or  products) 
5-  Evaluation  and  output  of  result,  Quit  if  done 
6.  Selection  of  next  sub-problem  and  go  to  2. 


Fortunately,  the  above  functions  interact  infrequently  and  in 
well  defined  ways.     This  enables  a  modular  program  design  and  the  use 
of  overlay  structure  to  increase  the  effective  size  of  the  computer. 
The  program  being  developed  at  Princeton  embodies  a  new  approach  of 
giving  the  computer  a  molecular  model  to  which  it  may  refer  in  its 
analysis.     Thus  where  the  OCSS  family  of  programs  had  five  modules,  there 
are  now  six  modules: 

1 .  Graphical  communication 

2.  Strategy  and  Control 
5-  Model  builder 

h.  Perception  of  structural  features 

5.  Symbolic  structural  manipulation 

6.  Evaluation  of  precursors  -  ' 

Several  of  these  modules  will  nqv-be  described  to  poT"tray  in  a 
gPHPral  way  the  form  of  information  and  processing  involved  and  t^e 
significance  of  the  form.  ^ 
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Graphical  Conimunication 

"The  graphical  communication  module  lets  the  chemist-computer  team 
converse  freely  in  terms  of  standard  structural  diagrams.  u^OCSS'pio- 
neered  the  tide  of  a  two-dimensional  Rand  Tablet  and  cathode  ray  tube(CRT) 
for  this  purpose.     Recently  we  reported  preliminary  work  using  an 
acoustic  three-dimensional  tablet  for  entry  of  chemical  structures  into 
a  computer  and  an  Evans  and  .Sutherland  display  system  for  stereo  viewing 
of  the  structures.^    The  acoustic  tablet  consists  of  three  strip  micro- 
phones and  a  pen  which  emits  small  sparks.    With  this  equipment  which 
was  developed  at  Princeton,  the  program  can  sense  the  pen  X,Y,Z  coordi- 
nates and  the  status  of  a  button  on,  the  side  of  the  pen  which  signifies 
the  be'ginning  of  a  bond  or  a  pointing  action.     The  current  graphics 
module  is  written  using  a  new  FORTRAN  General  Interactive  Graphics  Lan- 
guage^(GIGL)  and  permits  entering  two-dimensional  structural  diagrams, 
freehand  three-dimensional  drawings,  or  a  relatively  accurate  molecular 
model  traced  from  an  actual  physical  model  of  the  Fieser  or  Dreiding  type. 

By  using  graphical  communication,  debugging  is  simplified,  the 
language  and  nomenclature  problems  are  avoided,  and  the  program  becomes 
palatable  to  organic  chemists  who  now  can  understand  the  input  and  output 
without  learning  codes  or  in  the  case  gf  J-D;  without  having  to  supply 
atomic  coordinates.     The  internal  representation  which  the  chemist  never 
sees  is  a  rather  standard  connection  table  which  forms  the  fundamental 
structural  representation. ^ 


Model  Building 

This  module  must  construct  a  fairly  accUrata  three-dimensional 
model  of  the  target  molecule  an4  subsequent  precursors  if  requested, 
regardless  of  the  type  of  structure,  with  or  without  initial  atomic 
coordinates  and  in  a  minimum  amout  of  time.     The  approach  taken  in- 
corporates heuristic  classical-mechanical  constraint  satisfaction  tech- 
niques'^ to  derive  a  model  of  minimum  "strain  energy."    It  relies  on 
the  chemist  to  guide  it  to  the  desired  conformation  of  the  molecule, 
which  he  can  easily  do  using  the  acoustic  tablet  in  J-D.  Experience 
with  the  program  indicates  that  it  can  usually  build  a  reasonable  model 
given  only  a  connectivity  table  or  even  given  random  coordinates-- 
showing  that  it  is  good  at  avoiding  false  minima.    Details  of  this 
program  module  will  be  presented  in  a  later  paper. 

Perception  of  Chemical  Features 

the  perception  module,  the  eye  o^f  the  program,  takes  the  raw  connec- 
tion table  created  by  the  graphics* module  and  digests  it  into  more  useable 
and  recognizable  forms.     This  processing  is  carried  out  on  a  variety 
of  levels  from  primitive  to  complex,  creating  new  data  structures 
appropriate  to  the  complexity  of  the  perception.     On  the  lowest  level 
information  is  simply  transferred  from  the  connection  table  into  sets. 
A  set  is  a  contiguous  string  of  binary  digits  (bits).     If  the  Uh 
bit  in  the  set  is  a  "l",  then  the  i^th  object  is  a  member  of  that  set. 
Examples  are  given  below.    Although  sets  can  store  only  binary  informa- 


e 


-  ioioooolo 

Set 

of  atoms  alpha  to  atom  2 

SADA(5,2) 

-  000010000 

Set 

of  ^toms  degree  5  away  from  2 

SAASA(SAAA(2)') 

=  mioioii 

Set 

atoms  a  to  a  set  of  atom  2 

OXYGEN 

=  000000110 

Set 

of  oxygen  atoms 

B0ND2SET 

-  000001100 

Set 

of  doubly  bonded  atoms 

AND (OXYGEN, B0ND2SET) 

-  000000100 

Set 

of  doubly  bonded  oxygen  atoms 

tion,  the  combination  of  sets  by  logical  operators  provides  a  simple 
means  for  expressing  complex  processing.     Further,  digital  computers 
process  bit  strings  in  parallel,  52  or  56  bits  at  a  time,  making  set  > 
operations  th^  most  efficient  processing  available.    A  complete  set  of 
FORTRAN  routines  provides  every  conceivable  set  operation. 

On  the  next  level  of  complexity,  functional  groups  are  recognized. 
Now  wc  need  to  store  the  type  of  functional  group,  and  how  the  "standard 
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form"  of  the  group  maps  onto  the  target  molecule,  that  is,  the  origin 
of  the  group  and  its  constituent  atoms  and  bonds.    This  requires  a  new 
data  structure  called  a  list  or  linked-list.  \  A  list  is  composed  of  cells 
which  come  from  and  are  returned  to  a  common  ^storage  pool  of  cells. 
The  "head"  of  the  cell  contains  a  datura  or  a  pointer  to  a  sub  list  and  the 
"tail"  contains  a  pointer  to  the  next  cell  on  the  list  or  an  "end"  code. 
The  groups  in  the  previous  example  would  be  represented  as  below,  (The 
box  represents  the  hea^d  of  the  cell  which  contains  packed  data.  ) 


GROUPS 
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Each  sub list  is  a  grcAip  which  has  a  type  and  a  group  sequence  number. 
Cells  following  contain  a  bond-atom  pair  except  for  the  first  cell. 
The  chemist's  origin  of  the  group  is  usually  the  second  cell  of  the 
group  (the  third  cell  of  the  sublist).    Lists  are  useful  for  storing 
data  which  is  of  variable  length  and  structure  such  as  groups  and  rings, 
For  the  flexibility,  we  pay  a  penalty  of  a  50^  increase  in  consumption 
of  storage  space  and  processing  time. 

The  perception  continues,  building  on  lower  level  perceptions,  to 
recognize  and  represent  more  complicated  groupings  such, as  aromatic 
rings  and  groups  extended  by  congugati^'on  with  double  bonds*  Lastly 
perception  utilizes  its  molecular  model  ajid  derives  chemically  signif- 
icant features  such  as  close  spatial  arrangement  of  two  groups. 

Control  and  Strategy  Module 

Preliminary  decisions  on  what  general  goals  should  be  sought  are 
made  by  the  strategy  module.     Using  the  perceived  i^ormation  and  any 
global  information,  it  attempts  to  determine  what  the  most  im^rtant 
pr'oblem  is,  eg^,  to  change  foreign    ring  systems  into  familiar  ones,  to 
ch'knge  sensitive  groupings  into  more  stable  ones,  etc.     It  also  keeps 
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track  of  the  precursors  created  by  subgoals  so  they  can  be  operated  upon 
by  the  higher  level  goals.     The  design  of  the  goal  oriented  executive 
was  based  on  early  artificial  intelligence  research  in  chessf  and  geometry 
theorem  proving.®    The  generality  of  this  technique  permitted  major 
changes  in  the  synthetic  strategies  and  processing  with  little  or  no 
change  in  the  executive. 

T'ransform  Selection  or  "What  to  Do" 

Everything  prior  to  this  has  been  preparatory  for  this  stage  of 
processing.    The  problem  is  "how  do  we  choose  what  chemical  transform  to 
use?"    Each  chemical  reaction  can  be  characterized  by  the  pattern  of 
functional  groups  or  other  structural  features  it  leaves  in  the  product. 
Thus  we  organize  the  reactions  by  special  features  or  relationships 
found  typically  ii^  the  products  of  the  reaction.    For  example,  the  Aldol 
reaction  leaves  a  1,3  hydroxy -ketone  or  1,3    double  bond-ketone  in  the 
product.     The  first  step  then  in  answering  this  question  is  to  make  ap- 
propriate associations  between  target  molecule  and  chemical  reactions. 

Functional  groups  are  assigned  to  classes  according  to  functions  they 
perform,  e^,  CYANO  group  is  also  an  electron  withdrawing  group  (WGROUP). 
Two  groups  are  selected  and  the  relationships  between  them  are  determined 
as  the  number  of  bonds  separating  them,  stereochemical  relationships 
or  even  spatial    relationships.    Each  relationship  with  all  possible 
pairings  of  class  names  is  matched  against  the  associative  memory. 
(This  is  a  simplification,  there  are  several  associative  tables.)  A 
match  means  the  relationship  may  be  chemically  significant.     So  far  the 
search  has  been  target-driven. 

When  a  match  is  found  in  the  associative  memory,  an  interpreter 
takes  over  in  a  table -driven  search  to  evaluate  the  environment  of  the 
grbup3  and  the  path  between  them.     The  interpreter  examines  coded 
entries  in  a  chemical  table  which  cause  it  to  determine  the  presence  or 
absence  of  certain  features  and  increase  or  decrease  the  rating  of 
that  transform  accordingly.     The  idea  for  this  target-driven:table- 
driven  combination  stems  from  the  design  of  the  efficient  functional 
group  recognizer  in  OCSS.^    This  combination  only  requires  storage  of 
the  key  structural  features  recognized  by  the  general  perception  module, 
and  leaves  detailed  perception  to  the  environment  evaluating  interpreter 
as  needed  to  answer  specific  queries  in  the  table.     It  would  be  im- 
practical and  undesirable  to  attempt  to  anticipate  all  possible  queries 
by  perceiving  all  possible  relationships. 

But  perhaps  the  most  important  feature  of  this  method,  and  in  fact 
the  reason  it  was  selected,  is  the  language  used  to  create  the  coded 
tables.     This  language  was  developed  for  OCSS  in  1969  and  since  then 
has  undergone  considerable  extension.    A  simple  associative  entry  for  the 
Michael  reaction  is  written  in  an  early  dialect  of  the  language  to 
demonstrate  the  readability  of  the  statementls. 
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A  key  concept  in  organic  chemistry  is  that  bonds  are  made  along 
a  linear  path  on  the  structure  although  the  patli  may  be  cyclic.  Thus 
when  the  program  finds  a  pair  of  groups  and  the  path  between  them,  it 
has  already  established  a  reference  system  for  describing  and  executing 
a  transformation  involving  those  groups.    A  canonical  ordering  of  the 
groups  determines  where  the  numbering  starts. 

0                           0  0  0 

H    ^_  II   >     I  II 

Ci-C2^C3  +     C4— Cs    <  CrC2-C3^C4-C5 

Michael  Reaction.     Single  arrow  is  the  synthetic  direc- 
tion; double  arrow  is  the  analytic  direction.     The  partial 
associative  entry  is  shown  below. 

0X0  0X0  5 
\ PRIORITY  50 
^SUBT  50  IF  WGROUP  CBN5 

SUBT  10  FOREACH  RGRP  ALPHATO  CBNJ  OFFPATH 
KILL  IF  DBOND  ANYWHERE  ONPATH 


CONDN  SLALK 

BREAK  B0ND3  ^ 
MAKE  B0ND2 

When  a  match  is  found  in  the  associative  memory,  the  matched  trans- 
formation is  given  a  base  priority  for  execution.     The  environment  of 
the  groups  is  checked  by  qualification  statements  in  the  associative 
entry.     In  the  example,  the  presence  of  an  electron  withdrawing  group 
on  C3  on  the  path  lowers  the  priority  of  the  transform  by  30.    The  pri- 
ority of  the  transform  is  lowered  further  by  10  for  each  alkyl  group 
attached  to  C3  not  on  the  path.    Finally,  the  transformation  is  dropped 
from  consideration  if  there -is  a  double  bond  apywhere  on  tl^e  path 
C1-C5.     If  there  are  groups  sensitive  to  slightly  alkaline  conditions, 
the  conditions  of  this  reaction,  then  the  priority  is  further  lowered. 
The  last  two  entries  direct  the  program  how  to  perform  the  transformation 
in  the  analytic  direction.     The  transforms  are  executed  in  the  order 
of  their  final  priorities  with  a  minimum  priority  cut-off. 

Evaluation 

Machine  evaluation  of  precursors  has  so  far  only  been  concerned 
with  the  valence  of  atoms,  electronic  stability,  implausible  bonding, 
and  in  a  crude  way  with  precursor  simplicity^^    The  mol&cular  model 
will  improve  the  evaluation  of  strain  in  the  structure.    The  missing 
criterion,  which  the  chemist  has  supplied,  is  the  liklihood  .of  the 
forward  reaction  occurring  as  predicted.    A  forward  working  chemistry 
module  could  certainly  help  fill  this  gap. 
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Remaining  Problems 

The  OCSS  program  has  been  quite  successful  in  the  design  of 
syntheses  of  molecules  containing  relatively  few  interacting  functional 
groups.    A  major  problem  is  the  ordering  of  operations  on  multifunctional 
compounds.    In  part  this  is  a  result  of  the  difficulty  in  predicting 
relative  reactivity  of  complicated  functional  groups. 

Another  problem  is  that  of  writing  machine  independent  programs* 
Our  current ^projectT^^-using  ASA  FORTRAN  IV  only  because  it  is  universal. 
The  language  is  poor  for  symbolic  processing.    We  have  however  through 
the  "plex"  programraina'  technique    managed  to  make  the  program  indepen- 
dent of  the  exact  macnine  representation  of  the  data  even  though  the 
data  is  packed.    Ihus  transfer  from  one  machine  to  another  requires 
rewriting  only  a  small  machine  language  interface* 

Conclusions 

The  future  of  this  problem  solving  method  appears  bright.  The 
new  approach  of  having  a  5"D  model  may  open  the  areas  of  stereo-select- 
ivity, steric  hindrance  and  proximity  effects.    One  can  envision  giving 
the  program  an  electronic  model  as  well  with  which  to  evaluate  charge 
distribution  and  orbital  symmetry.    Certainly  the  power  of  the  programs 
will  increase  as  its  factual  chemistry  base  increases  and  as  chemical 
heuristics  are  developed.    While  we  have  not  surmounted  the  "barrier," 
it  appears  we  are  climbing  in  the  right  direction. 
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Computer-liased  Searching  of  the  Current  Chemical 
Literature  by  tne  Individual  Scientist 

Ljerhard  Kiehlmann, 
Simon  Fraser  University,  BURNABY  2,  B-C»,  Canada 

> 

(Experiences  Gained  After  Two  Years  of  Participation  in  the  CAiVSDI 
Project,  National  Science  Library,  iJational  Research  Council  of 
Canada) 


At  present,  some  300,000  papers  of  ciiemical  interest,  pub- 
Jiisned  in  12,000  journals,  2d  countries  and  17  languages,  are  ab- 
stracted axmually  by  Chemical  Abstracts*    Selective  Dissemination 
of  Information  (SDI),  i.e.,  a  computer-based  pre-selection  of  arti- 
cles specifically  related  to  the  individual  scientist's  research  in- 
terest from  this  enormous  number  of  source  documents,  appears  to  be 
the  most  efficient  way  for  the  active- research  chemist  to  keep  up- 
to-date  on  recent  developments  in  his  field. 

^  Since  196j  tne  titles,  ke.ywords,  author  names  and  other  bib- 
liographic data  of  all  articles  covered  by  the  major  chemical  ab- 
stract and  title  publications— Chemical  Abstracts  (OA),  Chemical 
Titles  (CI)  and  Science  Citation  Jkidex  (SlI)— have  been  stored  on 
maqnetif:  taoe,  and  these  tapes  are  available  for  computer-search- 
ing against  personal  interest  profiles  prepared  uy  or  for  individ- 
ual chemists.    The  important  parameters  of  publication  frequency, 
journal  coveraae  and  time  delay  between  publication  of ♦ the  primary 
document  and  release  of  the  tape  are  summarized  in  Table  I.  Since 
April  of  this  year,  the  even  (physical,  analytical,  macromol- 
acular  and  apolied'  chemistry)  and  odd  issues  (organic  and  oio- 
cnei.iistr^  0^  OA  can  be  searched  separately. 

To  review  briefly  the  ter^iinology  used  in  writing  a  "search 
profile",  independent  ciiemical  concepts  become  profile 
"parameters"  which  are  linked  together  by  i\AO  logic.    In  the 
example  shown  in  Figure  1  which  attempts  to  retrieve  aU  current 
articles  on  ".lydroqen  oondinn  in  iial o-alcofiols"  HYJKOGlN  oOi^JliiG, 
.i.Mvrou.;  ajj  ALCvl.iJL  represent  tnre'=^  different  parameters,  each 
^arar^of^r  nay  consisl:  of  sevfiral  profile  "terms"  whicii  represent 
closely  relat'^J  words,  ^»irasps  or  synonyms  witnin  the  same  con- 
cept or  oaraneter  and  w.iicfi  are  link3d  uy  OK  logic.    In  addition, 
t<uT-  lo'iic  may  oe  uspd  to  -xclud^,  certain  terms.    A^star  at  tne  , 
u^ninnin^i  or  end  of  a  ter.i  snows  tn^  facility  of  front  or  iiwl 
truncation  w.iici  is  used  to  avoiu  tiie  listing  of  many  words  witn 
the  same  root.    This  specif ic- prof ilf^  will  retrieve  all  paoer^ 
whiCii  contain  in  t.ieir  titla  czt  least  one  term  of  each  parameter. 
i'r>c  computer  printout  (Fiqur^  2)  liscs  n.ie  words  or  phrases 
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leading  to  retrieval,  the  complete  bibliographic  reference  (muti- 
lated author  names,  title,  journal,  page  numbers,  etc.),  abstract 
number  (in  case  of  CAC)  and  number  of  the  CA  or  CT  issue  searched. 
This  printout  which  is  in  convenient  card  form  for  direct  filing 
is  mailed  to  the  chemist  weekly  (CAC,  ISI)  or  biweekly  (CT). 

One  of  the  factors  contributing  to  the  rajther  poor  accept- 
ance of  computer-based  search  techniques, by  academic  scientists 
is  undoubtedly  their  belief  that  the  translation  of  a  research 
topic  into  computer-readable  form  is  a  complicated  process' which 
may  require  the  learning  of  some  computer  language.    Quite  to  the 
contrary,  profile  writing  is  extremely  easy  (see  Figure  1),*  and 
the  coding  rules  can  be  mastered  by  any  scientist  in  a  matter  of 
an  hour.    And  if  he  does  not  wish  to  do  his  own  coding,  nearly 
every  Canadian  university ^library  has  one  or  more  staff  members 
who  have  received  training  as  "search  editors"  and  can  provide 
expert  assistance.    At  the  present  time,  the  science  librarians  of 
the  three  universities  in  British  Columbia  are  considering  a  joint 
full-time  appointment  of  such  a  person.    All  the  individual 
scientist  has  to  provide  is  a  clear  narrative  description  of  his 
search  request^  a  list  of  10-20  complete  bibliographic  references 
from  his  file  of  relevant  reprints  and  perhaps  a  number  of  syn- 
onyms for  certain  specialized  terms  which  may  be  used  by  other 
chemists  in  the  field. 

The  staff  of  CAS  (Chemical  Abstracts  Service)  and  NSL 
(National  Science  Library)  are  continually  modifying  their  com- 
puter programs  in  order  to  simplify  coding  even  further  and  to 
add  versatility  to  their  search  service.    Recent  improvements 
include: 

a)  Elimination  of  tne*  distinction  between  title  (T)  and  keyword 
(K)  term^  the  latter  of  which  were  used  exclusively  for 
searching  the  keyword  indices  of  the  weekly  CA  issues.  Thus 
in  order  to  search  both  titles  and  index,  each  term  nad  to 
be  listed  twice      at  twice  the  cost.    A?:  a  corrolary, 
front  truncation  and  the  use  of  phrases  are  now  allowed 

for  all  search  terms. 

b)  In  Addition  to  author  names,  titles  and  keywords,  type  of 
publication  (Books,  conference  proceedings,  dissertations, 
journal  articles,  patents,  technical  reports),  language 
and  CA  section  codes  are  now  searchable. 

In  order  to  evaluate  the' results  of  searching  various  data 
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.  bases  against  personal  interest  profiles  and  to  syggest  possible 
profile  improvements,  NSL  mails  to  each  subscriber  a  "response 
form"  (Figure  3),  i.e.,  a  duplicate  printout  af  each  retrieved 
reference  which  provides  three  spaces  for  judging  the  relevancy 
of  the  reference:    YES,  NO  or  CANNOT  TELL.    These  response  forms, 
if  regularly  returned  to  Ottawa,  are  analyzed  by  NSL  to  measure  the 
profile  efficiency  in  terms  of  its  "precision"  which  is  defined 
as  the  ratio  of  relevant-hits  (a  "hit"  is  a  computer-retrieved 
reference)  to  the  total  number  of  hits.    The  five  profiles  pre- 
pared by  11  faculty  members  of  the  Chemistry  Department  of  Simon 
Frassr  University  (SFU)  and  searched  in  the  CAiVSDI  Project  have 
been  found -to  give  an  average  25%  precision  on  a  total  average 
weekly  printout  of  20  references  per  search  question,  I.e., 
five  relevant  papers  each  week.    The  so-called  "recall"  ratio 
wnicn  is  obtained  by  dividing  the  number  of  relevant  papers 
retrieved  by  the  total  number  of  relevant  papers  present  on  tne 
uata  base  nay  be  used  as  another  quantitative  measure  of  profile 
efficiency.   Although  this  ratio  is  hard  to  measure,  regular 
Manual  checking  of  the  tables  of  content  of  the  2b  most  ijnportant 
journals  publishing  in  my  research  area  indicates  fairly  high 
r-ecall  (approximately  6b-^)  although  my  interests  cover  tne  wide 
fiffld  of  carbonyl  condensation  reactions  wnich  cannot  be  easily 
described  in  GO  profile  terms. 

This  result  shows  clearly  one  of  the  major  advantages  of 
coi'iputer  based  search  techniques  over  manual  methods:  By 
simply  scanning  approximately  20  references  arriving  on  my  desk 
.?v«rv  veek,    I  discover  about  four  out  of  every  sixVelevant 
paoers  puDlisned  in  ny  research  area  (assuming  20'«  precision  and 
o7,o  recall)  and,  even  more  important,  this  includes  articles  from 
some  of  the  most  obscure  chenical  journals.    Most  of  the  un- 
retneved  third  of  relevant  papers  are  generally  picked  up  by 
cieckinj  cited  r'^f^rences,  manual  scanning  of  tables  of  content 
anu  r-jaainq  of  a  fow  major  journals. 

Comparinq  Ua"  tnree  major  chemical  data  bases  with  respect 
CO  rocall  ratio  (extent  of  journal  cover-iqa),  currency  and  con- 
v-menc^,  I  wouIj  rate  CaC  (Chemical  Abstracts  Condensates)  first 
Mltiiouqn  LA  IS  less  current  tnan  CT  by  aDout  10  weeks,  it  offers 
essential ly  complete  coveraq°  of  tno  fislu  and  tne  convenience  of 
liscino  tip  abstract  nuhioer  for  ii.n.iediate  reference  to  mors 
o"taileJ  information  tnan  revealed  yr^a  title.    Since  life 
science  (biology,  neaicine,  etc.)  jcfurnSils  are  strongly  represented 
on  uhG  ISI  (Institutp  for  5c1entific\nl\)rraation,  Philadelpnia) 
tajies,  tins  Jata  uase  is  particular! y^iVandgeous  for  biochemists 
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Thus  only  one  question  remains  to  be  answered:    Can  an 
academic  scientist  financially  afford  a  subscription  to  a 
computer-based  alerting  service  in  these  times  of  tight  budgets? 
Subscription  to  the  source  tapes  is  clearly  cost-prohibitive  for  any 
single  university.    However,  searching  these  data  bases  against 
a  large  number  of  personal  interest  profiles  at  independent  re- 
gional information  centres  like  NSL,  Ottawa,  is  financially  feasiole. 
According  to  the  new  NSL  pricing  schedule  which  is  based  on  approxi- 
mately 700  subscriptions,  regular  weekly  Searching  of  CAC  against 
a  60-tenn  profile  will  cost  $117  annually.    At  SFU,  we  have  cut  the 
cost  to  the  individual  even  further  by  combining  the  search  re- 
quests of  several  people  on  one  profile.    However,  this  technique 
works  only  if  the  interests  of  the  cooperating  chemists  are  di- 
vergent enough  to  avoid  relevant  computer  printouts,  ending  up  1n 
one  mail  box  only  when  they  really  would  have  been  of  interest  to 
two  or  three  participants.   Thus  with  three  contributors  to  one 
60-tenn  profile  the  entire  CAC  data  base  can  be  searched  weekly 
for  less  than  $40  per  person  per  year.    Weekly  , searching  of  the 
Science  Citation  ^ndex  for  all  papers  which  have  cited  one 
specific  target  reference  would  cost  an  additional  $3.12  per  year. 
The  subscription  rates  for  the  ISI  Source  tape  are  comparable 
while  CT  searching  is  considerably  less  expensive. 

I  am  certain  that  in  the  near  future  additional  data  bases 
will  become  available  for  searching  and  that  wider  acceptance  of 
SJI  services  by  academic  and  industrial  chemists  will  lower  the 
costs  even  further.    The  next  step  I  can  envisage  for  the  devel- 
opment of  these  techniques  is  the  searching  ahd  printing  of  entire 
abstracts  instead  of  bibliographic  references  alone.    The  possi- 
bility of  using  computer-based  services  to  keep  up-to-date  on 
recent  advances  in  tne  field  of  chemical  educatiop  or,  for  tnat 
mavtter,  in  the  field  of  computer-based  methods  of  information 
retrieval  should  not  be  overlooked. 
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TABLE  1 


Tape  Service  Characteristics  of  Major  Chemical 
*     Data  Bases 
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CT 
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4  weeks 
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Journal  coverage 

Titles  per  issue 

Average  time 
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Searchaole  elements: 

Title  ^ 
iCeyv/or'd 

Author  A 

Journal 

Citation 


yeekly 

iVooo 

\ 
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X 
X 
X 
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X 
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Weekly 
2000* 
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X 
X 
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*  all  fields  of  science 
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FIGURE  2 

Computer  Printout  of  Bibliographic  Data 

♦ 

hydroxy,  enol,  chlor.  hydrogen  bond" 
perelygi,  i.s..ukhunov.  t.f. 
infrared  spectra  and  hydrogen  bonding  of  hydroxyls 
of  chlorosubstituted  phenols  . 

optika  i  spectroskopiya 

volume  0029,  issue  0003,  year  1970,  page  516-18 
phenols**chlorinated**ir**hydrogen**bonding**c'hloro**' 

AN   007977  0803  €N  14  C0274 
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FIGURE  3 

Computer  Printout  Response  Form 

*  "  —  ■  ■    ~    ™       "  ~ 

HYDROXY,  ENOL,  CHLOR,- HYDROGEN  BOND 
PEREL,YGI    I.S.,  AKHYNOV,  T»F. 

INFRARED 'spectra  AND  HYDROGEN  BONDING  OF  HYDROXYLS 
»  OF  CHLOROSUBSTITUTED  PHENOLS. 

.  OPTIKA  I  SPECTROSKOPIYA 
VOLUME    0029,  ISSUE  0003.  YEAR  1970.  PAGE  516-18 

PHENOLS<*CHLORINATED**IR**HYDROGEN**BONDING**CHLORO** 

r 

IS  THIS  CITATION  USEFUL?    YES     NO     CAf^NOT  TELL 

AN    007977  0803  EN    U  C0274  >■ 
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Use  of  the  ASTM-IR  File  for  Education 

Joseph  R.  Denk 
*  Curriculinn  Development  Manager  ' 

North  Carolina  Educational  Computing  Service 
P.  0.  Box  12175 

Research  Triangle  Park,  Morth  Carolina  27709 


The  Dow-ASTM  file  of  coded  infrared  spectra  for  92,000  com- 
pounds has  become  available  through  several  commercial  sources^. 
These  commercial  sources  offer  a  service  which  includes  a  running 
software  package  to  do  searches  on  the  file  itself.    To  the  know- 
ledge of  this  author,  all  of  these  services  are  limited  to 
searches  in  which  a  infrared  spectrum  is  "identified"  from  a  com- 
bination of  coded  peaks  fo;:  tl^unknown  spectrum  and  a  limited 
amount  of  structural  information  of  the  unknown  material.*  The 
file  itself,  once  available,  allows  much  more  versatility  in 
searching  and  several  possibilities  for  educational  innovation. 

The  American  Society  for -Testing  and  Materials,  ASTM,  can 
supply  the  original  file  of  92,000  compounds  and  an  update  of  50,000 
additional  compounds.    This  data  base  can  be  described  as  having 
three  categories  of  information  for  each  entry: 

(a)  Coded  spectral  data, 

(b)  Structural  Information, 

(c)  Other  chemical  information.  > 

Since  the  file  has  both  inorganic  and  organic    entries,  the  infor-c 
mation  in  each  -category  differs  for  these  two  general  chemical 
classifications. 

For  organic  compounds the  infrared  spectral  coding  is  avail- 
able for  bands  having^ over  30  percent  absorption  in  the  range  of  ^ 
1.0  to*16.0  microns.    Structural  information  for  each  entry  is  sum- 


^Among  these  are  Sadtler  Corporation,  Singer  Corporation,  and  Dow 
Chemical  Corporation. 
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marized  in  Table  !•  ^her  che&ical  information  includes  melting 
or  boiling  points  and  empirical  formula  data. 

Inorganic  com^H^Cinds  have  spectral  coding  from  11,0  to  35.0 
microns  for  bands  over  30  percent  absorbance.    Structural  infor- 
mation is  limited  to  the  presence  of  92  possible  Cations  and  75 
possible  radicals.    Again,  melting  or  boiling  point  information 
as  well  as  empirical  formula  data  are  available. 

Besides  the  pbtrious  usage  of  such  a  file  for  organic  quali- 
tative analysis  .b"3^js^archirig,  the  large  amount  of  structural 
information  suggested  to  the  author  several  educational  possi- 
bilities.   An  inversion  of  the  searching  technique,  i.e.j  going 
from  structural  it^formation  alone  to  spectra,  would  allow  semi- 
quantitative studied  in  the  effects  of  electron-feeding  groups, 
electron-withdrawing  groups,  steric  hindrance,  etc.,  or  an 
activ^  infrared  site.    Band  shifts  and  complications  would  be 
available  from  a  Qompound  "constructed"  and  searched  from  the 
'file,  of  published  spectra.    These  possiblities  are  relevant 
to  chemistry  courses  covering  structural  theory.  Spectroscopy, 
chemical  literatu^€i>  and  undergraduate  research  besides  those 
courses  involving  organic  or  inorganic  qualitative  analysis. 

The  author  obtained  the  Dow-ASTM  file  and  constructed  a 
searching  system  for  searching  the  file  with  any  combination 
of  input  information  to  produce  any  combination"* of  file  infor- 
mation.   This  system,  ^Ii^frared  Spectral  Information  Sy&tem, 
ISIS,  allows  organic  qualitative  analysis  through  spectral 
information  or  through  a  combination  of  spectral  and  structural 
information.    ISIS  also  allows  pulling  out  spectra  that  match  a 
combination  of  structural  parameters  designed  by  the  user. 
Further,  ISIS  allows  any  combination  of  these  approaches. 
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TABLE  1 

Structural  Information  for  Organic  Compounds 
in  the  ASTM  Infrared  File 


.General 

Specific 
DymDoiic  ir roups 

PnQ"f  ^  "f  nr\  f  o\             TTt^  a  a  ^iiT*a  ryry 
^  v'OX  L  XULl       /     UJL     UIlocl  LUJL  Cl  LxUn 

in  Chains 

uxygen  croups 

Position's)  of  Unsaturation 
in  Rings 

Nitrogen  Groups 

Presence  of  Unsaturation 

Oxygen-Nitrogen  Groups 

Structural  Type 

Sulfur  Groups 

Ring  Size 

Sulfur -Nitrogen  Groups 

Chain  Length 

Sulfur-Oxygen  Groups 

Number  of  Rings 

Sulfur-Oxygen-Nitrogen  Groups 

Position(s)  of  Substitution 

Position(s)  of  Substitutes 
in  Ring& 

Physical  State 

Direction  of  Rotation 

Cis--Trans  Isomerism 

Presence  of  Spiro  Arrangement 

T^pe  ot  Alkyl  Group 

Type  of  Alkenyl  Group 

Type  of  Aikynyl  Group 

Presence  of  Conjugation 

Pres*§nce  of  Heterocyclic  Compounds 

% 
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Five  general  statements  are  involved  in  ISIS  for  searching: 
RUNTYPE,  OPTIONS,  INCLUDE/ EXCLUDE,  NOBAND,  and  SPECTRUM.  The 
RUNTYPE  statement  allows  a  search  through  spectra  for  identif icatio 
a  search  through  structure  for  spectra,  and  combinations  of  these 
approaches.    OPTIONS  includes  wiggling  spectra  during  searches, 
subletting  the  file,  pointing  out  the  coded  spectral  information 
on  hits,  getting  the  names  of  the  hits  (instead  of  serial  numbers), 
and  requesting  structural  information*    INCLUDE/ EXCLUDE  state- 
ment allows  the  addition  of  structural  information  to  spectral 
searches •    Spectral  searches  use  the  NOBAND  and  SPECTRUM  state- 
ments to  input  spectral  information.  ^ 

During  the  Spring  of  1971,  when  ISIS  first  became  available, 
the  structure  to  spectra  searching  was  not  implemented  and  usage 
was  restricted  to  organic  qualitative  analysis.    Structural  theory 
studies  were  not  attempted  except,  in  testing  beginning  software. 
ISIS  is  now  almost  finished  and  structural  studies  will  take  place 
in  th6  fall  of  1971.    To  aid. in  this  work,  Charles  N.  Reilly  has 
studied  the  cleanliness  of  the  file  for  structural  information. 
These  results  are  yet  to  be  published. 

ISIS  can  do  4  search  for  less  than  $1.00.    This  factor  made 
the  production  of  another  system  (for  IBM  360  Models  50  ^nd  up) 
worthwhile  educationally  since  commercial  searches  ate  ten  times 
more  expensive.    Information  on  the  availability  of  the,  softwaife 
(searching  system  in  360  ASSEMBLER  and  I/O  in  PL/I)  can  be  obtained 
from  the  author. 
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